Available Data Formats

In accordance with the open data policy pursued byAbes since 2012, the metadata produced by the alert networks managed byAbes are subject to the Open Licence of the State drawn up by the Etalab mission. The retrieval and reuse of the data is free of charge, provided that the date of retrieval and the source are indicated. The "conditions of use" are detailed for each data set.

Go to the Sudoc

Provision of data Sudoc in MARC formats


Export formats

Important: In order to follow the evolution of theMARCformats, changes to the export formats are regularly implemented:

  • The UNIMARC export format is updated as the international format evolves. To keep track of updates to the export formats, vendors and Library Management System (LMS) administrators are invited to refer to the tracking document (see below and on this page).
  • The MARC21 export format is based on the 1999 version of USMARC. Only a few evolutions have been or can be, punctually, taken into account. The data Sudoc in MARC21 are therefore provided in a structure that may not be up to date with the latest international developments.

Bibliographic records Sudoc are provided in an exchange format that conforms to ISO 2709 (ICSNo. 34.240.30), the international standard that defines the format for the computer exchange of bibliographic records:

The UNIMARC format is the reference for national and international data exchange. Read more about UNIMARC

Sample records Sudoc

Institutions and Library Management System providers have a sample of data at their disposal Sudocto allow them to test the interoperability of systems with Sudoc and thus ensure smooth data exchange. More information

Terms and conditions of use

To comply with data source citation requirements:

  • UNIMARC: retain the content of fields 801$b and 801$c
  • MARC 21: keep the contents of field 040$a

Examples of reuse

  • member libraries of the Sudoc / Sudoc-PS networks: feeding local systems (BMS, discovery tools...)
  • documentary structures outside the network Abes: enrichment of local systems (DBMS, discovery tools, etc.)
Read more

Data exposure Sudoc in MARC XML


Bibliographic records Sudoc are available in UNIMARC/MARCXML format

Note: the data provided in XML is converted "on the fly" from the UNIMARC export database, so that it is guaranteed to be up-to-date

Methods of recovery

    • per unit: from the record identifier Sudoc (n°PPN)
    • query syntax: https://www.sudoc.fr/[enter the pppn without the square brackets].xml

Terms and conditions of use

To comply with data source citation requirements, at least 801$b and 801$c should be kept, e.g. :

  <datafield tag="801" ind1=" " ind2="3">
  <subfield code="a">FR</<span">subfield>
  <subfield code="b">Abes</subfield>
  <subfield code="c">20210217</subfield>
  <subfield code="g">AFNOR</subfield>

Some UNIMARC fields/subfields cannot be converted to XML. These are mainly :

  • fields from external data sources whose providers do not allow exposure, e.g. fields 100 and 101 (publication dates/language) of a serial record identified in the ISSN Register which are subject to validation by CIEPS
  • some areas not exposed due to the complexity of their modelling or the difference in granularity with UNIMARC

Examples of reuse

  • Single unit data Sudoc recovery in a more manageable format than ISO 2709
  • reuse of data Sudoc as a bibliographic repository
  • aggregation of records in distinct formats using XML as a pivot format, which allows for example the aggregation of data produced within different cataloguing networks (data from Sudoc, from Calames or from theses.fr)

more: Enabling the UNIMARC/MARCXML web service

Read more

Data exposure Sudoc in RDF


In line withAbes 's policy of exposing data Sudoc on the Web of Data, bibliographic records Sudoc can be retrieved in RDF format. As the data provided in RDF is converted "on the fly" from the UNIMARC export database, it is guaranteed to be up-to-date on a daily basis

Methods of recovery

  • per unit: from the record identifier Sudoc (PPN number)
    • query syntax: https://www.sudoc.fr/[enter the pppn without the square brackets].rdf

Terms and conditions of use

In order to comply with the data sourcing requirements, the following elements should be mentioned in the file header:

  • dcterms:creator rdf:resource="http://www.idref.fr/033702462/id"/: identifies theAbes (PPN of the authority record " Abes ")
  • dcterms:created: date of creation of the record Sudoc
  • dcterms:modified: date of modification of the notice Sudoc

Limitations

Some UNIMARC fields/subfields cannot be converted to RDF. These are mainly :

  • certain fields from external data sources whose suppliers do not allow exposure, such as fields 100 and 101 (publication dates/language) of a serial identified in the ISSN Register which are subject to validation by CIEPS
  • some areas not exposed due to the complexity of their modelling or the difference in granularity with UNIMARC.

For more information: see the documentation UNIMARC - RDF Correspondences

Examples of reuse

  • Single unit data Sudoc recovery in a more manageable format than ISO 2709
  • reuse of data Sudoc as a bibliographic repository
  • aggregation of records in distinct formats using RDF as a pivot format, which allows for example the aggregation of data produced within different cataloguing networks (data from Sudoc, de Calames or theses.fr)
Read more

Display of CPR records in MARC


The RCR - Resource Centre Directory records describe the documentary institutions that are members of the Sudoc and Sudoc-PS networks. They are structured in a specific MARC format, with precisely described fields. Consult the documentation

These data are enriched with geolocation information, which optimizes their reuse when designing innovative applications or services to enhance library collections.

Available from the Répertoire des Centres de Ressources Sudoc and the Répertoire des Bibliothèques du Catalogue Collectif de France - CCFr, RCR records, as authority records, are freely retrievable from the services IdRef, notably via the triple store data.idref.fr.

Methods of recovery

From theIdRef interface, it is possible to retrieve RCR data in XML format:

  • per unit  From the identifier of a record Sudoc (PPN number).
  • per batch  with the help of the iln2rcr webservice by indicating the identifier of the institution of affiliation (ILN) of the different libraries (RCR). This webservice can cover several ILNs.

Note: the RCR data accessible fromIdRef is also partially exposed in RDF format.

Examples of reuse

  • library directories
  • exploitation of geolocation data associated with CPR records
Read more

The data ecosystem Sudoc


View the presentation: What has become of our data? Data beyond theAbes information system (Journées Abes 2022)