Is there a fast tool to show all the unique unicode characters in a file and their count?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP












1















Does anybody know a tool to tabulate all the unique unicode characters and their counts in a file?










share|improve this question




























    1















    Does anybody know a tool to tabulate all the unique unicode characters and their counts in a file?










    share|improve this question


























      1












      1








      1


      1






      Does anybody know a tool to tabulate all the unique unicode characters and their counts in a file?










      share|improve this question
















      Does anybody know a tool to tabulate all the unique unicode characters and their counts in a file?







      python unicode coreutils






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Jan 22 at 19:47









      Rui F Ribeiro

      40k1479135




      40k1479135










      asked Jan 22 at 19:38









      user1424739user1424739

      1061




      1061




















          1 Answer
          1






          active

          oldest

          votes


















          2














          I'm not sure what you mean exactly with "unicode characters". To count the different characters in a file you could do something like this:



          $ awk -v FS="" -v OFS="t" 'for(i=1;i<=NF;i++) char[$i]++ END for(i in char) print i,char[i]' input.txt


          With -v FS="" we set the field separator to nothing. So each character is handled as a single field. In each line we iterate over these fields using the character as a key for the list and increment the count with ++. If all lines were read, we iterate over the counting list and print each key (which represents the character) and its count.






          share|improve this answer

























          • Note that this is going to be locale-dependent (and there are certainly conceptions of "unicode character" that it doesn't satisfy, as you noted).

            – Michael Homer
            Jan 22 at 20:58










          Your Answer








          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "106"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f496044%2fis-there-a-fast-tool-to-show-all-the-unique-unicode-characters-in-a-file-and-the%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          2














          I'm not sure what you mean exactly with "unicode characters". To count the different characters in a file you could do something like this:



          $ awk -v FS="" -v OFS="t" 'for(i=1;i<=NF;i++) char[$i]++ END for(i in char) print i,char[i]' input.txt


          With -v FS="" we set the field separator to nothing. So each character is handled as a single field. In each line we iterate over these fields using the character as a key for the list and increment the count with ++. If all lines were read, we iterate over the counting list and print each key (which represents the character) and its count.






          share|improve this answer

























          • Note that this is going to be locale-dependent (and there are certainly conceptions of "unicode character" that it doesn't satisfy, as you noted).

            – Michael Homer
            Jan 22 at 20:58















          2














          I'm not sure what you mean exactly with "unicode characters". To count the different characters in a file you could do something like this:



          $ awk -v FS="" -v OFS="t" 'for(i=1;i<=NF;i++) char[$i]++ END for(i in char) print i,char[i]' input.txt


          With -v FS="" we set the field separator to nothing. So each character is handled as a single field. In each line we iterate over these fields using the character as a key for the list and increment the count with ++. If all lines were read, we iterate over the counting list and print each key (which represents the character) and its count.






          share|improve this answer

























          • Note that this is going to be locale-dependent (and there are certainly conceptions of "unicode character" that it doesn't satisfy, as you noted).

            – Michael Homer
            Jan 22 at 20:58













          2












          2








          2







          I'm not sure what you mean exactly with "unicode characters". To count the different characters in a file you could do something like this:



          $ awk -v FS="" -v OFS="t" 'for(i=1;i<=NF;i++) char[$i]++ END for(i in char) print i,char[i]' input.txt


          With -v FS="" we set the field separator to nothing. So each character is handled as a single field. In each line we iterate over these fields using the character as a key for the list and increment the count with ++. If all lines were read, we iterate over the counting list and print each key (which represents the character) and its count.






          share|improve this answer















          I'm not sure what you mean exactly with "unicode characters". To count the different characters in a file you could do something like this:



          $ awk -v FS="" -v OFS="t" 'for(i=1;i<=NF;i++) char[$i]++ END for(i in char) print i,char[i]' input.txt


          With -v FS="" we set the field separator to nothing. So each character is handled as a single field. In each line we iterate over these fields using the character as a key for the list and increment the count with ++. If all lines were read, we iterate over the counting list and print each key (which represents the character) and its count.







          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited Jan 22 at 20:09

























          answered Jan 22 at 19:52









          finswimmerfinswimmer

          52416




          52416












          • Note that this is going to be locale-dependent (and there are certainly conceptions of "unicode character" that it doesn't satisfy, as you noted).

            – Michael Homer
            Jan 22 at 20:58

















          • Note that this is going to be locale-dependent (and there are certainly conceptions of "unicode character" that it doesn't satisfy, as you noted).

            – Michael Homer
            Jan 22 at 20:58
















          Note that this is going to be locale-dependent (and there are certainly conceptions of "unicode character" that it doesn't satisfy, as you noted).

          – Michael Homer
          Jan 22 at 20:58





          Note that this is going to be locale-dependent (and there are certainly conceptions of "unicode character" that it doesn't satisfy, as you noted).

          – Michael Homer
          Jan 22 at 20:58

















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Unix & Linux Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f496044%2fis-there-a-fast-tool-to-show-all-the-unique-unicode-characters-in-a-file-and-the%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown






          Popular posts from this blog

          How to check contact read email or not when send email to Individual?

          Bahrain

          Postfix configuration issue with fips on centos 7; mailgun relay