Building the German Research Data Infrastructure NFDI – for and with Computer Science

On November 4, 2022, the Joint Science Conference (GWK) selected Schloss Dagstuhl – Leibniz Center for Informatics and the consortium NFDIxCS for federal and state funding within the German National Research Data Infrastructure (NFDI). The consortium will be funded in the double-digit millions of Euros and over a duration of five years. Together with its sister consortium NFDI4DataScience, this is already the second NFDI grant that has been awarded to Schloss Dagstuhl. The NFDI is a collaborative, nationwide network to systematically index, interconnect, and make openly available the valuable stock of data from science and research. Together with 16 other leading partner institutions, Schloss Dagstuhl aims to systematically manage research data in computer science. The main goal of NFDIxCS is to Read more…

Updates to the dblp RDF schema

In the six months since the release of the dblp RDF dump and its persistent snapshot releases, the RDF dump has been downloaded a total of about a thousand times. We are pleased to see that the community is interested in using our semantic data in their research and beyond. However, based on the feedback we have received and the experience we have had experimenting with query interfaces for the data, we have also noticed a number of shortcomings in our schema that we want to address. Therefore, as of September 9, 2022, we are changing a few details about the dblp RDF schema. These changes aim (in part) to make the schema more convenient in practice, e.g., in SPARQL Read more…

OpenAlex integration in dblp

For more than 12 years, the Microsoft Academic Search, later renamed to just Microsoft Academic and eventually to Microsoft Academic Graph (MAG), had been the software giant’s  scholarly bibliographic information service. Despite it being one of the most comprehensive collections across all scientific fields out there, Microsoft obviously never envisioned MAG to be a lasting infrastructure for the community, but rather a playground for certain internal research and technology projects. Consequently, when Microsoft decided to pull the plug and end MAG in 2021, it came to the dismay of many researchers who were relying on MAG as a data source. At this point, the fine people at OurResearch, who were already providing the very useful Unpaywall service, stepped in and Read more…

dblp in RDF

For more than 20 years, a full dump of all dblp records in our own XML format has been available as open data for download and reuse. These dump files have always been in high demand over the years (with 500+ downloads in February 2022 alone) and are used as a research dataset in numerous publications. For quite some time now, we have been asked to provide a full RDF dump as well. Snapshots of the dblp XML file have been converted to RDF before by members of the community, and there are still a number of those RDF files available on the Internet. However, the problem with these snapshots is that they are usually not updated once they are Read more…

Results and learnings from the dblp User Survey 2020/2021

From December 2020 to January 2021, we asked you to participate in an online survey in order to help us understand how researchers are using dblp, and how dblp and its features are perceived by the public. The response exceeded our expectations: We received the amazingly high number of 1046 responses to our survey in total, with 760 surveys fully answered and 286 surveys partially completed. This response provided us with a plethora of helpful remarks, constructive criticisms, novel ideas, and feature requests that will guide us in the development of dblp in the upcoming years. We would like to express our heartfelt gratitude for your kind support of dblp! In this blog post, we will share with you the Read more…

We are hiring!

Interested in working on the world’s most comprehensive open indexing service in computer science? We are currently looking for three additional members to join the dblp team. This includes the opportunity to qualify yourself for a doctorate in the context of the challenges arising from managing and operating a semantic research information infrastructure like dblp. For more information, please read our current job openings.

Schloss Dagstuhl becomes part of the National Research Data Infrastructure for Data Science and Artificial Intelligence

On July 2, 2021, the German Joint Science Conference (Gemeinsame Wissenschaftskonferenz, GWK) decided to fund the National Research Data Infrastructure (NFDI) consortium for Data Science and Artificial Intelligence (NFDI4DataScience) with an amount in the double-digit millions over a duration of five years. In this consortium, Schloss Dagstuhl has joined forces with numerous other leading research infrastructure providers in Germany. The NFDI is a collaborative, nationwide network to systematically index, interconnect, and make openly available the valuable stock of data from science and research. Dagstuhl’s renown research infrastructures – supporting research itself, the publication and dissemination of research results, and finding and reusing them – will be further developed, expanded, and integrated as part of NFDI. The results of this consortium Read more…

New dblp URL scheme and API updates

A big change has just been made to the dblp website … and, in case we did our job right, you may even haven’t noticed yet: With the latest update, we introduced major changes to the dblp URL scheme. In particular, this applies to the URLs of all author bibliographies listed on dblp, which are now served under a new and persistent URL. But don’t worry, just like the first time we made such a change about eight years ago, we try to keep all previously existing URLs as a redirect for the foreseeable future. In this post, we talk about the reasons that made us abandon our old URL scheme and why you will most likely want to update Read more…

dblp and ORCID 2020

In the past, we often discussed how helpful ORCIDs are for our work. An ORCID (Open Researcher and Contributor ID) is a unique personal identifier that scientists can attach to their work. The ORCID ensures that this work is linked to the correct scientist an not to someone else with the same or similar name. We at dblp use ORCIDs to create clean bibliographies. A bibliography should list the work of a single researcher and of course a unique identifier is very helpful here. In this post I will give a short overview on how we handle ORCID and how prevalent it is in DBLP just now. If you do not have an ORCID, consider getting one (for free) at orcid.org. Please make sure that it is attached to your publications whenever possible.

We started experimenting with ORCID in 2016. A more complex integration began in 2017 when we also started to show ORCIDs in bibliographies and individual publications. At the same time we made ORCIDs available with our data releases. We obtain most ORCIDs directly from the publishers together with other publication meta data such as title and author names. ORCID was established in 2012 and many publishers started to attach ORCIDs to their publications only recently (or do not do that at all). But authors can claim such works on their own. This information is provided by ORCID via their annual data dump which we also map to our data set. This means that ORCID has become a common type of data in our collection. Below you see the fraction of signatures in dblp for which an ORCID is known. A signature is a pair of author name and paper. So a paper with five authors has five signatures.

Fraction of signatures with ORCID

An ORCID is now available for 12% of all our signatures and that number is going up. At the moment, we add ORCIDs to dblp in batches. This means that a publication can appear in dblp without any ORCIDs. A few days later they are added. We are working to streamline this process for a faster integration.

(more…)