Blog
Name disambiguation suffixes in dblp
At the end of March 2020 dblp provides bibliographies for almost 2.5 million scientists. With this number, it is not surprising that we have namesakes – scientists with the exact same name. For historical reasons, all persons in dblp must have different names. We circumvent this problem by assigning numeric suffixes to names that are not unique. E.g., there are multiple Thomas Müller in dblp. So we name them Thomas Müller 0001, Thomas Müller 0002 and so on. See our FAQ here for more details. Identifying authors with the same name is a very important task. For example, the bibliography of Thomas Müller 0001 should not list papers by another Thomas Müller. This is a very common problem. I recently Read more…