From 3120370736ebc5e518d5fa0bfe212f161cd1499a Mon Sep 17 00:00:00 2001
From: Yaroslav Halchenko Vincent Larivière (Canadian Research Chair on the Transformation of Scholarly Communication, University of Montréal) Catriona MacCallum (Director of Open Science, Hindawi Open Access Publisher) Cameron Neylon (Professor of Research Communications, Centre for Culture and Technology, Curtin University) Katherine Skinner (Reasearch Lead, Invest in Open) Katherine Skinner (Research Lead, Invest in Open) Didier Torny (on behalf of the French Open Science Committee) Ludo Waltman (Professor of Quantitative Science Studies and Deputy Director, Centre for Science and Technology Studies (CWTS), Leiden University) [chair] Sequence number:
along with XPath identifiers, discourse elements are identified with a more human-readable sequence number
(e.g. Section n. International Advisory Board for Open
Scope
1, Paragraph n. 3, Table n. 2),
- indentifying their relative position in the document. CCC stores sequence numbers of discourse elements that include at least one in-text reference.
OCI: an OCI is a global persistent identifier of citations. It usually appears in the form oci:<citing>-<cited> where citing and cited are locally assigned numerical identifiers of respectively a citing document and a cited document.
In CCC an OCI is assigned to both the general citation - in the same form <citing>-<cited> - and to every occurrence of an in-text reference in the citing document relevant to that citation.
For instance: the article identified as 0701 in CCC cites the article identified as 07090, and two in-text references appear in the citing article referencing the cited article.
- The general OCI for the citation will be 0701-07090, while the two specific citations instatiated by in-text references will be addressed as 0701-07090/1 and 0701-07090/2 respectively.
0701-07090, while the two specific citations instantiated by in-text references will be addressed as 0701-07090/1 and 0701-07090/2 respectively.
InTRePID: the In-Text Reference Pointer Identifier (InTRePID) is a global unique persistent identifier (PID) of in-text reference pointers.
- InTRePID is an extention of OCI that appears in the following form: intrepid:<oci>/<ordinal>-<total>
+ InTRePID is an extension of OCI that appears in the following form: intrepid:<oci>/<ordinal>-<total>
where <oci> is is the numerical part of the OCI identiying a citation between a citing and cited entity,
<ordinal> is the nth occurrence of an in-text reference pointer within the text of the citing entity
relevant to the cited entity addressed in the OCI, and <total> is the total number of in-text reference pointers
diff --git a/html-template/datasets.html b/html-template/datasets.html
index 47a84a9..04a7810 100644
--- a/html-template/datasets.html
+++ b/html-template/datasets.html
@@ -82,7 +82,7 @@
OpenCitations Index. The Index contain information about the citations themselves, in which the citations, instead of being considered as simple links, are treated as first-class data entities in their own right. The Index do not store metadata about the citing and cited bibliographic entities internally. Rather, these entities are identified in the Index by their unique identifiers, i.e. the OpenCitations Meta Indentifier (OMID), enabling bibliographic information to be retrieved on-the-fly upon request by means of the OpenCitations Meta API. +
OpenCitations Index. The Index contain information about the citations themselves, in which the citations, instead of being considered as simple links, are treated as first-class data entities in their own right. The Index do not store metadata about the citing and cited bibliographic entities internally. Rather, these entities are identified in the Index by their unique identifiers, i.e. the OpenCitations Meta Identifier (OMID), enabling bibliographic information to be retrieved on-the-fly upon request by means of the OpenCitations Meta API.
OpenCitations Meta. OpenCitations Meta stores and delivers bibliographic metadata for all publications involved in the OpenCitations Index.
This page is a legacy page (not linked anymore from the official website) that links all the dumps produced by OpenCitations before October 2023 that are not mantained anymore.
+This page is a legacy page (not linked anymore from the official website) that links all the dumps produced by OpenCitations before October 2023 that are not maintained anymore.
This page details of and links to all the data dumps of the OpenCitations Indexes, the Open Biomedical Citations in Context Corpus and of the OpenCitations Corpus (OCC). They are made available online by means of the support of Figshare and of the Internet Archive.
Each dump of an OpenCitations Index is composed by four zip archives. Two of these archives contains the actual data and provenance information of the index in N-Triples, while the other two archives contain the same information in CSV.
Each dump of the Open Biomedical Citations in Context Corpus is composed by one single zip artchive containing all the information about actual data and provenance stored in JSON-LD.
diff --git a/html-template/home.html b/html-template/home.html index 08a7439..85e30db 100644 --- a/html-template/home.html +++ b/html-template/home.html @@ -85,7 +85,7 @@OpenCitations is an independent not-for-profit infrastructure organization for open scholarship dedicated to the publication of open bibliographic and citation data by the use of Semantic Web (Linked Data) technologies. It is also engaged in advocacy for open citations, particularly in its role as a key founding member of the Initiative for Open Citations (I4OC). For administrative convenience, OpenCitations is managed by the Research Centre for Open Scholarly Metadata at the University of Bologna.
-OpenCitations espouses fully the founding principles of Open Science. It complies with the FAIR data principles by Force11 that data should be findable, accessible, interoperable and re-usable, and it complies with the recommendations of I4OC that citation data in particular should be structured, separable, and open. On the latter topic, OpenCitations has recently published a formal definition of an Open Citation, and has launched a system for globally unique and persistent identifiers (PIDs) for bibliographic citations – Open Citation Identifiers (OCIs).
+OpenCitations espouses fully the founding principles of Open Science. It complies with the FAIR data principles by Force11 that data should be findable, accessible, interoperable and reusable, and it complies with the recommendations of I4OC that citation data in particular should be structured, separable, and open. On the latter topic, OpenCitations has recently published a formal definition of an Open Citation, and has launched a system for globally unique and persistent identifiers (PIDs) for bibliographic citations – Open Citation Identifiers (OCIs).
OpenCitations' involvement in international networks and collaborations, together with the need of identifying and reaching out to new stakeholders to assure OpenCitations' development and sustainability, has made it necessary to define OpenCitations' mission, unique strengths and next developmental steps, summarized in the following publicly available documents: OpenCitations Mission Statement, The Uniqueness of OpenCitations and OpenCitations – Present Status and Future Plans.
diff --git a/html-template/meta.html b/html-template/meta.html index 261a644..deb33c8 100644 --- a/html-template/meta.html +++ b/html-template/meta.html @@ -80,7 +80,7 @@The OpenCitations Meta database stores and delivers bibliographic metadata for all publications involved in the OpenCitations Index.
-For each publication, the metadata exposed by OpenCitations Meta includes the publication's title, type, venue (e.g. journal name), volume number, issue number, page numbers,
For each publication, the metadata exposed by OpenCitations Meta includes the publication's title, type, venue (e.g. journal name), volume number, issue number, page numbers, publication date, and identifiers such as Digital Object Identifiers (DOIs) and PubMed Identifiers (PMIDs). In addition, OpenCitations Meta includes details of the main actors involved in the publication of the document, i.e., the names of the authors, editors, and publishers, each with its own additional metadata and identifier (e.g. ORCID).
Currently, OpenCitations Meta contains:
Every entity in OpenCitations Meta is assigned persistent internal identifier called OpenCitations Meta Identifier (OMID). The OMID has structure [[entity_type_abbreviation]]/[[supplier_prefix]][[sequential_number]].
- For example, the first journal article ever processed has OMID br/0601 (the full URI is https://w3id.org/oc/meta/br/0601), where br is the abbreviation of bibliographic resource, and 060 corresponds to the supplier prefix, helpful in recognising at a glance the index it belongs to (i.e., OpenCitations Meta). Finally, 1 indicates that this is the index's first bibliographic resource ever minted.
br/0601 (the full URI is https://w3id.org/oc/meta/br/0601), where br is the abbreviation of bibliographic resource, and 060 corresponds to the supplier prefix, helpful in recognising at a glance the index it belongs to (i.e., OpenCitations Meta). Finally, 1 indicates that this is the index's first bibliographic resource ever minted.
The entities subject to deduplication and associated with an OMID are identifiers (abbr. id), agent roles (i.e., authors, editors, publishers, abbr. ar),
responsible agents (i.e., people and organisations, abbr. ra), resource embodiments (i.e., pages, abbr. re),
diff --git a/html-template/tools.html b/html-template/tools.html
index 937f484..1eb879f 100644
--- a/html-template/tools.html
+++ b/html-template/tools.html
@@ -90,7 +90,7 @@
OpenCitations created several software applications and libraries used to create all the data stored in the various databases. In particular:
BEE, a software for parsing articles from the Open Access Subset of biomedical literature hosted by Europe PubMed Central (EPMC) and encoded in JATS/XML; we have recently extended it for creating the CCC corpus, which now allows to extract also in-text references and discourse elements (e.g. sentences, paragraphs, sections)
SPACIN, a software for transforming JSON data (output of BEE) into RDF according to the OCDM; we have recently extended it for creating the CCC corpus, so as to reengineer information about in-text references and discourse elements.
SPACIN, a software for transforming JSON data (output of BEE) into RDF according to the OCDM; we have recently extended it for creating the CCC corpus, so as to reengineer information about in-text references and discourse elements.
Create New Citations, a software for creating citation indexes (i.e. doi-to-doi citations) according to OCDM;
BCite, a user-friendly web application - released in beta version - for creating RDF-data according to OCDM from lists of bibliographic references;
The first in-text reference The first in-text reference rp/0701 appears as "Doe et al. 2020". It appears in the firt section de/0701, called "Introduction",
+ rp/0701 appears as "Doe et al. 2020". It appears in the first section de/0701, called "Introduction",
second paragraph de/0702, third sentence de/0703 (being section, paragraph and sentence numbers relative to the entire document and not to the parent element).
Both in-text references and the discourse elements are also identified by a XPath.# the sentence
diff --git a/src/intrepid.py b/src/intrepid.py
index eaf870a..d51efcd 100644
--- a/src/intrepid.py
+++ b/src/intrepid.py
@@ -246,7 +246,7 @@ def validate(self):
if not self.intrepid.startswith("intrepid:"):
self.intrepid = "intrepid:" + self.intrepid
self.add_message("validate", W, "The InTRePID specified as input doesn't start with the 'intrepid:' "
- "prefix. This has beed automatically added, resulting in "
+ "prefix. This has been automatically added, resulting in "
"the InTRePID '%s'." % self.intrepid)
self.is_valid = False
diff --git a/src/oci.py b/src/oci.py
index 51d2d5b..cd1662b 100644
--- a/src/oci.py
+++ b/src/oci.py
@@ -711,7 +711,7 @@ def validate(self):
if not self.oci.startswith("oci:"):
self.oci = "oci:" + self.oci
self.add_message("validate", W, "The OCI specified as input doesn't start with the 'oci:' "
- "prefix. This has beed automatically added, resulting in "
+ "prefix. This has been automatically added, resulting in "
"the OCI '%s'." % self.oci)
self.is_valid = False
diff --git a/src/ramose.py b/src/ramose.py
index dd20bec..276e5f3 100644
--- a/src/ramose.py
+++ b/src/ramose.py
@@ -909,7 +909,7 @@ def conv(s, query_string, c_type="text/csv"):
content_type = Operation.get_content_type(c_type)
- # Overrite if requesting a particular format via the URL
+ # Override if requesting a particular format via the URL
if "format" in query_string:
req_formats = query_string["format"]
@@ -1363,7 +1363,7 @@ def exec(self, method="get", content_type="application/json"):
sc = r.status_code
if sc == 200:
- # This line has been added to avoid a strage behaviour of the 'splitlines' method in
+ # This line has been added to avoid a strange behaviour of the 'splitlines' method in
# presence of strange characters (non-UTF8).
list_of_lines = [line.decode("utf-8") for line in r.text.encode("utf-8").splitlines()]