How are paper authors uniquely identified?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP












12














Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?










share|improve this question



















  • 10




    ‘Harvesting papers’ - what, really, are you trying to do?
    – Jon Custer
    Dec 17 at 13:17










  • @Jon Custer Get the metadata of papers provided by different sources .
    – Agelos
    Dec 17 at 17:30






  • 9




    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
    – WBT
    Dec 17 at 18:27











  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
    – Sabine
    Dec 17 at 18:33






  • 3




    Related: How are scientific papers uniquely identified?
    – BlueRaja - Danny Pflughoeft
    Dec 17 at 18:36















12














Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?










share|improve this question



















  • 10




    ‘Harvesting papers’ - what, really, are you trying to do?
    – Jon Custer
    Dec 17 at 13:17










  • @Jon Custer Get the metadata of papers provided by different sources .
    – Agelos
    Dec 17 at 17:30






  • 9




    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
    – WBT
    Dec 17 at 18:27











  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
    – Sabine
    Dec 17 at 18:33






  • 3




    Related: How are scientific papers uniquely identified?
    – BlueRaja - Danny Pflughoeft
    Dec 17 at 18:36













12












12








12


2





Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?










share|improve this question















Some authors have an ORCID ID in order to be identified and distinguished in case of authors with similar name, change of name, different name format, etc. However, some sources don’t provide the ORCID ID of the author (if any exists), which causes a lot of problems when someone tries to harvest papers from websites with scholarly resources.



I thought that a combination of some author features such as name, email, and affiliation could be enough to distinguish the authors, but I don’t think this is a robust solution.



Is there any way to uniquely identify every author?







publications digital-researcher-id






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Dec 17 at 20:08









TRiG

378215




378215










asked Dec 17 at 9:05









Agelos

17227




17227







  • 10




    ‘Harvesting papers’ - what, really, are you trying to do?
    – Jon Custer
    Dec 17 at 13:17










  • @Jon Custer Get the metadata of papers provided by different sources .
    – Agelos
    Dec 17 at 17:30






  • 9




    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
    – WBT
    Dec 17 at 18:27











  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
    – Sabine
    Dec 17 at 18:33






  • 3




    Related: How are scientific papers uniquely identified?
    – BlueRaja - Danny Pflughoeft
    Dec 17 at 18:36












  • 10




    ‘Harvesting papers’ - what, really, are you trying to do?
    – Jon Custer
    Dec 17 at 13:17










  • @Jon Custer Get the metadata of papers provided by different sources .
    – Agelos
    Dec 17 at 17:30






  • 9




    See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
    – WBT
    Dec 17 at 18:27











  • Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
    – Sabine
    Dec 17 at 18:33






  • 3




    Related: How are scientific papers uniquely identified?
    – BlueRaja - Danny Pflughoeft
    Dec 17 at 18:36







10




10




‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 at 13:17




‘Harvesting papers’ - what, really, are you trying to do?
– Jon Custer
Dec 17 at 13:17












@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 at 17:30




@Jon Custer Get the metadata of papers provided by different sources .
– Agelos
Dec 17 at 17:30




9




9




See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 at 18:27





See J. Pfeffer & J. Pfeffer, "Another Article that Makes Bibliometric Analysis a Bit Harder," SIGBOVIK 2015, pp. 79-82. April 1, 2015. *Note: The specific date within that year is not a coincidence; it's a humor venue. PDF page numbers are +8.
– WBT
Dec 17 at 18:27













Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 at 18:33




Out of curiosity: Have you though of different spellings of the same name yet? E.g. the german family name Schröder might appear as Schröder (umlauts are available), Schroeder ("german" alternative when umlauts are not available) or Schroder ("scandinavian" alternative).
– Sabine
Dec 17 at 18:33




3




3




Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 at 18:36




Related: How are scientific papers uniquely identified?
– BlueRaja - Danny Pflughoeft
Dec 17 at 18:36










2 Answers
2






active

oldest

votes


















24














Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






share|improve this answer




























    8














    This is exactly what ORCID tries to achieve:




    ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




    However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
    The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






    share|improve this answer


















    • 1




      I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
      – burger
      Dec 17 at 23:46










    • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
      – L_W
      Dec 18 at 6:34











    Your Answer








    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "415"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2facademia.stackexchange.com%2fquestions%2f121740%2fhow-are-paper-authors-uniquely-identified%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    24














    Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



    For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



    If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






    share|improve this answer

























      24














      Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



      For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



      If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






      share|improve this answer























        24












        24








        24






        Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



        For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



        If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).






        share|improve this answer












        Aside from ORCID (which by far not every paper and person has), there really is no sure-fire way to uniquely identify an author. Using the name becomes problematic with common names (not unusual anywhere in the world, but a particularly common issue in Asia) or name changes (for instance in case of marriage). Combining with affiliation and e-mail address will also only get you so far as most academics tend to change universities at least once or twice in their career, and both affiliation and e-mail address tend to change in these cases.



        For bibliographic research, the most promising approach is probably to combine all of the above with field information (e.g., a Markus Huber publishing in medicine is not particularly likely to be the same as a Markus Huber publishing in philosophy), and train some sort of heuristic classifier. Clearly, false positives/negatives will happen, but if your goal is to holistically assess a larger field of research a few false categorizations are unlikely to impact the overall picture too much.



        If your goal is to assess an individual researcher, really the most accurate information is usually to trust what information the researchers themselves maintain (e.g., a CV or publicly available publication list).







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Dec 17 at 9:18









        xLeitix

        98.3k34237380




        98.3k34237380





















            8














            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






            share|improve this answer


















            • 1




              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
              – burger
              Dec 17 at 23:46










            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
              – L_W
              Dec 18 at 6:34
















            8














            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






            share|improve this answer


















            • 1




              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
              – burger
              Dec 17 at 23:46










            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
              – L_W
              Dec 18 at 6:34














            8












            8








            8






            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.






            share|improve this answer














            This is exactly what ORCID tries to achieve:




            ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. (from their website)




            However, not everybody is aware of this initiative or cares enough to set up an ORCID for themselves. Some journals request ORCIDs upon submission, e.g. for Nature Methods each Corresponding authors needs to have an ORCID.
            The problem with using other information to identify researcher, is that this information can change as opposed to a uniquely assigned number.







            share|improve this answer














            share|improve this answer



            share|improve this answer








            edited Dec 18 at 6:34

























            answered Dec 17 at 9:20









            L_W

            70619




            70619







            • 1




              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
              – burger
              Dec 17 at 23:46










            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
              – L_W
              Dec 18 at 6:34













            • 1




              I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
              – burger
              Dec 17 at 23:46










            • From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
              – L_W
              Dec 18 at 6:34








            1




            1




            I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
            – burger
            Dec 17 at 23:46




            I am not sure it's required. It might be encouraged. For example: nature.com/articles/s41592-018-0187-8
            – burger
            Dec 17 at 23:46












            From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
            – L_W
            Dec 18 at 6:34





            From the link in my answer: 'As part of our efforts to improve transparency in authorship, we request that all corresponding authors of published papers provide their Open Researcher and Contributor Identifier (ORCID) ID, before resubmitting the final version of the manuscript'. I don't know if this applies to all journal of the nature publishing group. And I also can't say how strictly this rule is enforced. But thanks, I changed require to request!
            – L_W
            Dec 18 at 6:34


















            draft saved

            draft discarded
















































            Thanks for contributing an answer to Academia Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.





            Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


            Please pay close attention to the following guidance:


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2facademia.stackexchange.com%2fquestions%2f121740%2fhow-are-paper-authors-uniquely-identified%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown






            Popular posts from this blog

            How to check contact read email or not when send email to Individual?

            Displaying single band from multi-band raster using QGIS

            How many registers does an x86_64 CPU actually have?