How does it work?

Lantern gives you the ability to pull together a full range of Open Access compliance measures used by funders and official agencies, making checking your publications incredibly easy.

Just upload a list of your identifiers - like DOIs - and we'll learn everything we can about them and send you the results.

Step 1:    Put your identifiers in a spreadsheet. As long as the sheet has one or more of "DOI", "PMID" or "PMCID" in the header row, with the corresponding identifiers underneath it, we can can work with it.

For example:

DOI PMID PMCID
10.abc/123 78543277  
10.jkl/8999/1    
  78024443 PMC75220165
... ... ...

You can download a template here if you like

Step 2:    Save your spreadsheet as a CSV. In Excel you can do this by choosing Save As and selecting CSV (Comma delimited) as the type. For more information see Microsoft's support page.

Step 3:    Sign in to Lantern and upload your CSV via the upload form on your home page.

Step 4:    Wait for us to gather the results for you. You can watch the progress bar or go off and grab a coffee. If you've given us a LOT of identifiers maybe go home and check back tomorrow. Either way, we'll email you with a link to the results when your job is finished.

Step 5:    Download the results (which are also in a CSV) and then open them in your favourite spreadsheet program. Excel will happily open CSVs for you, and allow you to re-save them as .xls files.


You'll have the following information available to you in the spreadsheet:

  • Some additional basic bibliographic information, such as Journal, Publisher, Title, and publication dates
  • Whether the publication is in a repository (and which repositories)
  • Whether the publication appears in EuropePMC, and some information about its status and version
  • What re-use licence the publisher has applied
  • Any embargo information about this publisher
  • Any self-archiving policies for the pre-print, post-print or publisher's version
  • Grant and PI information

If you want the full details of what information we gather, check out the output documentation, below.

In order to get you this information, Lantern queries a lot of external data services, so you should check out our sources page for more information on what we use and the re-use rights they may apply to the data

Output Documentation

When you download your results spreadsheet, there will be a lot of information in there; below you'll find details on what all of those fields mean.

Field Description
PMCID The EuropePMC identifier for the article. You may have provided us this initially when you uploaded your data, but if you didn't we will have tried to populate it for you.
PMID The PubMed identifier for the article. You may have provided us this initially when you uploaded your data, but if you didn't we will have tried to populate it for you.
DOI The DOI for the article. You may have provided us this initially when you uploaded your data, but if you didn't we will have tried to populate it for you.
Publisher The Publisher of the Journal in which the article was published.
Journal title The name of the Journal in which the article was published.
ISSN The ISSN(s) of the Journal in which the article was published; if more than one ISSN is found (e.g. the Print ISSN, Electronic ISSN or Linking ISSN) then they will be presented as a comma-separated list.
Publication Date The date the publisher made the article available (either in print or electronically). The date may contain both the year and month, or only the year, depending on what data is available. If no publication date can be found this column will contain the text "Unavailable".
Electronic Publication Date The date the publisher made the article available online. The date will be in the form YYYY-MM-DD (e.g. 2011-03-21). If no electronic publication date can be found this column will contain the text "Unavailable".
Article title The title of the article.
Author(s) List of the article's authors. First, we look for this information in Europe Pubmed Central. Then we check CrossRef. Finally we check CORE.
In CORE? Whether the article appears in the CORE aggregation. Presence in CORE indicates that some version of the article is available Open Access in a repository or other open database.
Archived Repositories A list of names of repositories in which a copy of some version of the article exists.
Fulltext in EPMC? Does the fulltext of the article appear in some form in EPMC? This is equivalent to the EPMC metadata record asserting inEPMC: Y
XML Fulltext? Does the XML version of the fulltext of the article appear in EPMC? This will be a subset of those where the fulltext appears in some form (some are PDF only), and all OA-subset articles SHOULD have this content available.
AAM? Is the copy of the paper in EPMC the Accepted Authors Manuscript (AAM)? If the XML fulltext is present in EPMC then this information will be lifted from there, otherwise we page-scrape the EPMC web page for the article to detect whether it is the author manuscript
Ahead of Print? Was the article published electronically before it was released in print. For journals which release both electronic and print versions, this will be either TRUE or FALSE. For journals which do not have print versions, this will be "not applicable". If we are unable to determine either the status of the journal or the status of the article this will be "unknown".
Open Access Is the article in the EPMC Open Access subset? This is equivalent to the EPMC metadata record asserting isOpenAccess: Y
Licence

What licence, if any, were we able to detect for the article. Will be one of the Creative Commons licences (including CC0), or "non-standard-licence".

Licence Source

Where did we find the licence information contained in the Licence column?

  1. epmc_rest_api - Detected via the EPMC REST API
  2. epmc_xml_permissions - Detected in the EPMC Fulltext XML under the permissions section
  3. epmc_xml - Detected elsewhere in the EPMC Fulltext XML
  4. epmc - Detected on the EPMC web page (via page scraping)
  5. publisher - Detected on the publisher's page for the article (via page scraping, or publisher API if one exists)
  6. unknown - We were unable to determine a licence for this article
Journal Type

Whether this is a hybrid or pure Open Access journal. Determined by whether the journal is present in the DOAJ

Preprint Embargo

The standard embargo period for preprints for the Journal in which this article is published, according to Sherpa Romeo

Preprint Self-Archiving Policy

The standard self-archiving policy for preprints for the Journal in which this article is published, according to Sherpa Romeo

Postprint Embargo

The standard embargo period for postprints for the Journal in which this article is published, according to Sherpa Romeo

Postprint Self-Archiving Policy

The standard self-archiving policy for postprints for the Journal in which this article is published, according to Sherpa Romeo

Publisher's Copy Embargo

The standard embargo period for publisher copies for the Journal in which this article is published, according to Sherpa Romeo

Publisher's Copy Self-Archiving Policy

The standard self-archiving policy for publisher for the Journal in which this article is published, according to Sherpa Romeo

Grant {X}

Grant Number associated with this article.

This is a repeated column, where {X} is a number used to identify the column, and to associate it with the related "Agency {X}" and "PI {X}" columns

Agency {X}

Agency responsible for funding this article

This is a repeated column, where {X} is a number used to identify the column, and to associate it with the related "Grant {X}" and "PI {X}" columns

PI {X}

The Principal Investigator of the grant associated with this article

This is a repeated column, where {X} is a number used to identify the column, and to associate it with the related "Grant {X}" and "Agency {X}" columns

Correct Article Confidence

What level of confidence does the system have that it has successfully identified and analysed the correct article?

Since there are different ways we might identify the article, we may not always be 100% certain that we have found the correct one. Values in this column are:

  1. 1 - we are certain this is the right article for the identifiers provided. This is because we found an exact match to the PMCID, PMID or DOI
  2. 0.9 - we are almost certain this is the right article, as we identified it by exact title match, and there was only one result
  3. 0.7 - we are pretty sure this is the right article, as we identified it with a title keyword search, and there was only one result
Provenance

This field contains detailed logging and provenance information for the processing run, allowing you to understand on a case-by-case basis how the decisions and analysis were made

Have we missed anything?

Is there some aspect of the documentation that is unclear, or any other questions you have? Don't hesitate to get in touch and we'll be happy to help you out.