Elbow Test using AIC/BIC for identifying number of clusters using GMM

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;







up vote
6
down vote

favorite
1












enter image description here



How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?










share|cite|improve this question























  • 6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
    – user2974951
    Sep 25 at 8:51
















up vote
6
down vote

favorite
1












enter image description here



How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?










share|cite|improve this question























  • 6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
    – user2974951
    Sep 25 at 8:51












up vote
6
down vote

favorite
1









up vote
6
down vote

favorite
1






1





enter image description here



How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?










share|cite|improve this question















enter image description here



How to select number of clusters using GMM when the elbow test (AIC/BIC vs n_components) results in a graph like this?







aic bic generalized-moments






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Sep 25 at 9:13









Ferdi

3,39342151




3,39342151










asked Sep 25 at 8:29









psangam

332




332











  • 6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
    – user2974951
    Sep 25 at 8:51
















  • 6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
    – user2974951
    Sep 25 at 8:51















6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
Sep 25 at 8:51




6 seems to be what you are looking for, if you are looking for a test there are a few, for ex. using clusGap statistic.
– user2974951
Sep 25 at 8:51










1 Answer
1






active

oldest

votes

















up vote
4
down vote



accepted










Welcome to CV!



This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.



Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.



Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.



In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.






share|cite|improve this answer




















    Your Answer




    StackExchange.ifUsing("editor", function ()
    return StackExchange.using("mathjaxEditing", function ()
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    );
    );
    , "mathjax-editing");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "65"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    convertImagesToLinks: false,
    noModals: false,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













     

    draft saved


    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368560%2felbow-test-using-aic-bic-for-identifying-number-of-clusters-using-gmm%23new-answer', 'question_page');

    );

    Post as a guest






























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    4
    down vote



    accepted










    Welcome to CV!



    This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.



    Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.



    Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.



    In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.






    share|cite|improve this answer
























      up vote
      4
      down vote



      accepted










      Welcome to CV!



      This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.



      Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.



      Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.



      In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.






      share|cite|improve this answer






















        up vote
        4
        down vote



        accepted







        up vote
        4
        down vote



        accepted






        Welcome to CV!



        This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.



        Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.



        Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.



        In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.






        share|cite|improve this answer












        Welcome to CV!



        This plot shows how the AIC and BIC change as a function of the number of clusters. While the AIC continuous to decrease with a larger number of clusters, you can see that the BIC stops decreasing after $k=6$ clusters. For this reason, you could choose $k = 6$.



        Another way to choose the 'best' number of clusters is by considering the elbow(s) of the figure. The elbow of a function is a point after which the decrease becomes notably smaller. An elbow is a heuristic, so there is no exact way to determine which value best describes this point. For example, one could argue that the AIC & BIC both stop decreasing as much after $k = 5$ clusters, while someone else might argue that this is after $k = 6$ clusters. You could even argue that the biggest decrease has already happened after $k = 2$ clusters.



        Lastly, you don't have to choose any number of clusters just because AIC/BIC/whatever suggested you do so. If you have some a priori reason to assume that there should be $k = 3$ clusters, then that might be a better choice.



        In short: An elbow in this context is a heuristic guide to decide the number of clusters if you have no other reason to assume a certain number of clusters.







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered Sep 25 at 8:53









        Frans Rodenburg

        3,107426




        3,107426



























             

            draft saved


            draft discarded















































             


            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f368560%2felbow-test-using-aic-bic-for-identifying-number-of-clusters-using-gmm%23new-answer', 'question_page');

            );

            Post as a guest













































































            Popular posts from this blog

            How to check contact read email or not when send email to Individual?

            Displaying single band from multi-band raster using QGIS

            How many registers does an x86_64 CPU actually have?