What is the name of an AI system that learns by trial and error?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
3
down vote

favorite












Imagine a system that controls dampers in a complex vent system that has an objective to perfectly equalize the output from each vent. The system has sensors for damper position, flow at various locations and at each vent. The system is initially implemented using a rather small data set or even a formulaic algorithm to control the dampers. What if that algorithm were programmed to "try" different configurations of dampers to optimize the air flows, guided broadly by either the initial (weak) training or the formula? The system would try different configurations and learn what improved results, and what worsened results, in an effort to reduce error (differential outflow).



What is that kind of AI system called? What is that system of learning called? Are there systems that do that currently?










share|improve this question









New contributor




SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1




    You have asked the same question twice here, just with a different title. Please don't do that. We should close one of the questions - please suggest which one.
    – Neil Slater
    Oct 3 at 19:11






  • 1




    There is a subtle difference between the questions. One assumes a more traditional learning process before initial implementation and then refinement of that learning through use. The other question asks about the "trial and error" process where the machine may make random or educated guesses to try and optimize its results.
    – SchroedingersCat
    Oct 3 at 19:41










  • You need to make the difference clearer then. In practice they look the same thing - phrasing like "the system would try different configurations and learn what improved results, and what worsened result" here, and " The algorithm CONTINUES to learn from its own empirical data," in the other question are not really different. It is no surprise that you have been answered with Reinforcement Learning in both cases.
    – Neil Slater
    Oct 3 at 21:08










  • Thank you Neil. I try to make my questions here quick to read and answer if possible, in this case it took two parts to get to the desired result. Through both questions I got the level of detail I needed.
    – SchroedingersCat
    Oct 4 at 13:20














up vote
3
down vote

favorite












Imagine a system that controls dampers in a complex vent system that has an objective to perfectly equalize the output from each vent. The system has sensors for damper position, flow at various locations and at each vent. The system is initially implemented using a rather small data set or even a formulaic algorithm to control the dampers. What if that algorithm were programmed to "try" different configurations of dampers to optimize the air flows, guided broadly by either the initial (weak) training or the formula? The system would try different configurations and learn what improved results, and what worsened results, in an effort to reduce error (differential outflow).



What is that kind of AI system called? What is that system of learning called? Are there systems that do that currently?










share|improve this question









New contributor




SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1




    You have asked the same question twice here, just with a different title. Please don't do that. We should close one of the questions - please suggest which one.
    – Neil Slater
    Oct 3 at 19:11






  • 1




    There is a subtle difference between the questions. One assumes a more traditional learning process before initial implementation and then refinement of that learning through use. The other question asks about the "trial and error" process where the machine may make random or educated guesses to try and optimize its results.
    – SchroedingersCat
    Oct 3 at 19:41










  • You need to make the difference clearer then. In practice they look the same thing - phrasing like "the system would try different configurations and learn what improved results, and what worsened result" here, and " The algorithm CONTINUES to learn from its own empirical data," in the other question are not really different. It is no surprise that you have been answered with Reinforcement Learning in both cases.
    – Neil Slater
    Oct 3 at 21:08










  • Thank you Neil. I try to make my questions here quick to read and answer if possible, in this case it took two parts to get to the desired result. Through both questions I got the level of detail I needed.
    – SchroedingersCat
    Oct 4 at 13:20












up vote
3
down vote

favorite









up vote
3
down vote

favorite











Imagine a system that controls dampers in a complex vent system that has an objective to perfectly equalize the output from each vent. The system has sensors for damper position, flow at various locations and at each vent. The system is initially implemented using a rather small data set or even a formulaic algorithm to control the dampers. What if that algorithm were programmed to "try" different configurations of dampers to optimize the air flows, guided broadly by either the initial (weak) training or the formula? The system would try different configurations and learn what improved results, and what worsened results, in an effort to reduce error (differential outflow).



What is that kind of AI system called? What is that system of learning called? Are there systems that do that currently?










share|improve this question









New contributor




SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











Imagine a system that controls dampers in a complex vent system that has an objective to perfectly equalize the output from each vent. The system has sensors for damper position, flow at various locations and at each vent. The system is initially implemented using a rather small data set or even a formulaic algorithm to control the dampers. What if that algorithm were programmed to "try" different configurations of dampers to optimize the air flows, guided broadly by either the initial (weak) training or the formula? The system would try different configurations and learn what improved results, and what worsened results, in an effort to reduce error (differential outflow).



What is that kind of AI system called? What is that system of learning called? Are there systems that do that currently?







machine-learning training ai-basics terminology






share|improve this question









New contributor




SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|improve this question









New contributor




SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this question




share|improve this question








edited Oct 3 at 19:53









DukeZhou♦

2,92021028




2,92021028






New contributor




SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked Oct 3 at 16:44









SchroedingersCat

454




454




New contributor




SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






SchroedingersCat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







  • 1




    You have asked the same question twice here, just with a different title. Please don't do that. We should close one of the questions - please suggest which one.
    – Neil Slater
    Oct 3 at 19:11






  • 1




    There is a subtle difference between the questions. One assumes a more traditional learning process before initial implementation and then refinement of that learning through use. The other question asks about the "trial and error" process where the machine may make random or educated guesses to try and optimize its results.
    – SchroedingersCat
    Oct 3 at 19:41










  • You need to make the difference clearer then. In practice they look the same thing - phrasing like "the system would try different configurations and learn what improved results, and what worsened result" here, and " The algorithm CONTINUES to learn from its own empirical data," in the other question are not really different. It is no surprise that you have been answered with Reinforcement Learning in both cases.
    – Neil Slater
    Oct 3 at 21:08










  • Thank you Neil. I try to make my questions here quick to read and answer if possible, in this case it took two parts to get to the desired result. Through both questions I got the level of detail I needed.
    – SchroedingersCat
    Oct 4 at 13:20












  • 1




    You have asked the same question twice here, just with a different title. Please don't do that. We should close one of the questions - please suggest which one.
    – Neil Slater
    Oct 3 at 19:11






  • 1




    There is a subtle difference between the questions. One assumes a more traditional learning process before initial implementation and then refinement of that learning through use. The other question asks about the "trial and error" process where the machine may make random or educated guesses to try and optimize its results.
    – SchroedingersCat
    Oct 3 at 19:41










  • You need to make the difference clearer then. In practice they look the same thing - phrasing like "the system would try different configurations and learn what improved results, and what worsened result" here, and " The algorithm CONTINUES to learn from its own empirical data," in the other question are not really different. It is no surprise that you have been answered with Reinforcement Learning in both cases.
    – Neil Slater
    Oct 3 at 21:08










  • Thank you Neil. I try to make my questions here quick to read and answer if possible, in this case it took two parts to get to the desired result. Through both questions I got the level of detail I needed.
    – SchroedingersCat
    Oct 4 at 13:20







1




1




You have asked the same question twice here, just with a different title. Please don't do that. We should close one of the questions - please suggest which one.
– Neil Slater
Oct 3 at 19:11




You have asked the same question twice here, just with a different title. Please don't do that. We should close one of the questions - please suggest which one.
– Neil Slater
Oct 3 at 19:11




1




1




There is a subtle difference between the questions. One assumes a more traditional learning process before initial implementation and then refinement of that learning through use. The other question asks about the "trial and error" process where the machine may make random or educated guesses to try and optimize its results.
– SchroedingersCat
Oct 3 at 19:41




There is a subtle difference between the questions. One assumes a more traditional learning process before initial implementation and then refinement of that learning through use. The other question asks about the "trial and error" process where the machine may make random or educated guesses to try and optimize its results.
– SchroedingersCat
Oct 3 at 19:41












You need to make the difference clearer then. In practice they look the same thing - phrasing like "the system would try different configurations and learn what improved results, and what worsened result" here, and " The algorithm CONTINUES to learn from its own empirical data," in the other question are not really different. It is no surprise that you have been answered with Reinforcement Learning in both cases.
– Neil Slater
Oct 3 at 21:08




You need to make the difference clearer then. In practice they look the same thing - phrasing like "the system would try different configurations and learn what improved results, and what worsened result" here, and " The algorithm CONTINUES to learn from its own empirical data," in the other question are not really different. It is no surprise that you have been answered with Reinforcement Learning in both cases.
– Neil Slater
Oct 3 at 21:08












Thank you Neil. I try to make my questions here quick to read and answer if possible, in this case it took two parts to get to the desired result. Through both questions I got the level of detail I needed.
– SchroedingersCat
Oct 4 at 13:20




Thank you Neil. I try to make my questions here quick to read and answer if possible, in this case it took two parts to get to the desired result. Through both questions I got the level of detail I needed.
– SchroedingersCat
Oct 4 at 13:20










3 Answers
3






active

oldest

votes

















up vote
1
down vote



accepted










I believe "Reinforcement Learning" is the term you are looking for (as mentioned by others as well) but keep in mind that the scope of your problem falls under the section of AI that is called Search.



Search algorithms are based upon experimenting with different actions (decisions) and selecting the one that minimizes an arbitrary cost function (reward), given the current and past problem states.






share|improve this answer



























    up vote
    2
    down vote













    Near solution to your problem definition is reinforcement learning. You can define some reward using the objective function and define some possible state space for the machine and finally solve the problem by reinforcement learning techniques (near to trial and error by learning the preferences).






    share|improve this answer



























      up vote
      0
      down vote













      I think any learning algorithm probably uses trial and error and analysis of the results with the ultimate goal of maximizing utility.



      It seems that the recent milestones in AI fall under the general umbrella of machine learning, which includes all forms of reinforcement learning. Essentially, any learning algorithm is using some form of statistical analysis.



      • For an umbrella term, I've been using "learning algorithm"

      However, there is also a venerable history of less capable adaptive systems such as self-organizing networks. (See also optimal control.)






      share|improve this answer






















        Your Answer




        StackExchange.ifUsing("editor", function ()
        return StackExchange.using("mathjaxEditing", function ()
        StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
        StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
        );
        );
        , "mathjax-editing");

        StackExchange.ready(function()
        var channelOptions =
        tags: "".split(" "),
        id: "658"
        ;
        initTagRenderer("".split(" "), "".split(" "), channelOptions);

        StackExchange.using("externalEditor", function()
        // Have to fire editor after snippets, if snippets enabled
        if (StackExchange.settings.snippets.snippetsEnabled)
        StackExchange.using("snippets", function()
        createEditor();
        );

        else
        createEditor();

        );

        function createEditor()
        StackExchange.prepareEditor(
        heartbeatType: 'answer',
        convertImagesToLinks: false,
        noModals: false,
        showLowRepImageUploadWarning: true,
        reputationToPostImages: null,
        bindNavPrevention: true,
        postfix: "",
        noCode: true, onDemand: true,
        discardSelector: ".discard-answer"
        ,immediatelyShowMarkdownHelp:true
        );



        );






        SchroedingersCat is a new contributor. Be nice, and check out our Code of Conduct.









         

        draft saved


        draft discarded


















        StackExchange.ready(
        function ()
        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fai.stackexchange.com%2fquestions%2f8243%2fwhat-is-the-name-of-an-ai-system-that-learns-by-trial-and-error%23new-answer', 'question_page');

        );

        Post as a guest






























        3 Answers
        3






        active

        oldest

        votes








        3 Answers
        3






        active

        oldest

        votes









        active

        oldest

        votes






        active

        oldest

        votes








        up vote
        1
        down vote



        accepted










        I believe "Reinforcement Learning" is the term you are looking for (as mentioned by others as well) but keep in mind that the scope of your problem falls under the section of AI that is called Search.



        Search algorithms are based upon experimenting with different actions (decisions) and selecting the one that minimizes an arbitrary cost function (reward), given the current and past problem states.






        share|improve this answer
























          up vote
          1
          down vote



          accepted










          I believe "Reinforcement Learning" is the term you are looking for (as mentioned by others as well) but keep in mind that the scope of your problem falls under the section of AI that is called Search.



          Search algorithms are based upon experimenting with different actions (decisions) and selecting the one that minimizes an arbitrary cost function (reward), given the current and past problem states.






          share|improve this answer






















            up vote
            1
            down vote



            accepted







            up vote
            1
            down vote



            accepted






            I believe "Reinforcement Learning" is the term you are looking for (as mentioned by others as well) but keep in mind that the scope of your problem falls under the section of AI that is called Search.



            Search algorithms are based upon experimenting with different actions (decisions) and selecting the one that minimizes an arbitrary cost function (reward), given the current and past problem states.






            share|improve this answer












            I believe "Reinforcement Learning" is the term you are looking for (as mentioned by others as well) but keep in mind that the scope of your problem falls under the section of AI that is called Search.



            Search algorithms are based upon experimenting with different actions (decisions) and selecting the one that minimizes an arbitrary cost function (reward), given the current and past problem states.







            share|improve this answer












            share|improve this answer



            share|improve this answer










            answered Oct 3 at 23:52









            pcko1

            2414




            2414






















                up vote
                2
                down vote













                Near solution to your problem definition is reinforcement learning. You can define some reward using the objective function and define some possible state space for the machine and finally solve the problem by reinforcement learning techniques (near to trial and error by learning the preferences).






                share|improve this answer
























                  up vote
                  2
                  down vote













                  Near solution to your problem definition is reinforcement learning. You can define some reward using the objective function and define some possible state space for the machine and finally solve the problem by reinforcement learning techniques (near to trial and error by learning the preferences).






                  share|improve this answer






















                    up vote
                    2
                    down vote










                    up vote
                    2
                    down vote









                    Near solution to your problem definition is reinforcement learning. You can define some reward using the objective function and define some possible state space for the machine and finally solve the problem by reinforcement learning techniques (near to trial and error by learning the preferences).






                    share|improve this answer












                    Near solution to your problem definition is reinforcement learning. You can define some reward using the objective function and define some possible state space for the machine and finally solve the problem by reinforcement learning techniques (near to trial and error by learning the preferences).







                    share|improve this answer












                    share|improve this answer



                    share|improve this answer










                    answered Oct 3 at 16:57









                    OmG

                    22227




                    22227




















                        up vote
                        0
                        down vote













                        I think any learning algorithm probably uses trial and error and analysis of the results with the ultimate goal of maximizing utility.



                        It seems that the recent milestones in AI fall under the general umbrella of machine learning, which includes all forms of reinforcement learning. Essentially, any learning algorithm is using some form of statistical analysis.



                        • For an umbrella term, I've been using "learning algorithm"

                        However, there is also a venerable history of less capable adaptive systems such as self-organizing networks. (See also optimal control.)






                        share|improve this answer


























                          up vote
                          0
                          down vote













                          I think any learning algorithm probably uses trial and error and analysis of the results with the ultimate goal of maximizing utility.



                          It seems that the recent milestones in AI fall under the general umbrella of machine learning, which includes all forms of reinforcement learning. Essentially, any learning algorithm is using some form of statistical analysis.



                          • For an umbrella term, I've been using "learning algorithm"

                          However, there is also a venerable history of less capable adaptive systems such as self-organizing networks. (See also optimal control.)






                          share|improve this answer
























                            up vote
                            0
                            down vote










                            up vote
                            0
                            down vote









                            I think any learning algorithm probably uses trial and error and analysis of the results with the ultimate goal of maximizing utility.



                            It seems that the recent milestones in AI fall under the general umbrella of machine learning, which includes all forms of reinforcement learning. Essentially, any learning algorithm is using some form of statistical analysis.



                            • For an umbrella term, I've been using "learning algorithm"

                            However, there is also a venerable history of less capable adaptive systems such as self-organizing networks. (See also optimal control.)






                            share|improve this answer














                            I think any learning algorithm probably uses trial and error and analysis of the results with the ultimate goal of maximizing utility.



                            It seems that the recent milestones in AI fall under the general umbrella of machine learning, which includes all forms of reinforcement learning. Essentially, any learning algorithm is using some form of statistical analysis.



                            • For an umbrella term, I've been using "learning algorithm"

                            However, there is also a venerable history of less capable adaptive systems such as self-organizing networks. (See also optimal control.)







                            share|improve this answer














                            share|improve this answer



                            share|improve this answer








                            edited Oct 3 at 20:20

























                            answered Oct 3 at 20:05









                            DukeZhou♦

                            2,92021028




                            2,92021028




















                                SchroedingersCat is a new contributor. Be nice, and check out our Code of Conduct.









                                 

                                draft saved


                                draft discarded


















                                SchroedingersCat is a new contributor. Be nice, and check out our Code of Conduct.












                                SchroedingersCat is a new contributor. Be nice, and check out our Code of Conduct.











                                SchroedingersCat is a new contributor. Be nice, and check out our Code of Conduct.













                                 


                                draft saved


                                draft discarded














                                StackExchange.ready(
                                function ()
                                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fai.stackexchange.com%2fquestions%2f8243%2fwhat-is-the-name-of-an-ai-system-that-learns-by-trial-and-error%23new-answer', 'question_page');

                                );

                                Post as a guest













































































                                Popular posts from this blog

                                Peggy Mitchell

                                Palaiologos

                                The Forum (Inglewood, California)