What page replacement algorithms are used in Linux kernel for OS file cache?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
5
down vote

favorite
6












Linux uses the unused portions of memory for file caching, and it cleans up the space when needed.



My question is about how it picks a victim page for replacement?
There are various algorithms (LRU, FIFO, LFU and random replacement)



I'd like to know
1) What page replacement algorithms are used in Linux kernel for OS file cache?



2) If possible, I'd like to know how it have evolved over time in Linux kernel. I assume that its algorithm and implementation might change overtime considering "reasonable" changes in trends. How can I find those? Do I need to read kernel source codes?










share|improve this question





















  • I'd suggest starting here, kernel.org and seeing if you can find the detail in the various documents and sources.
    – EightBitTony
    May 9 '16 at 9:05














up vote
5
down vote

favorite
6












Linux uses the unused portions of memory for file caching, and it cleans up the space when needed.



My question is about how it picks a victim page for replacement?
There are various algorithms (LRU, FIFO, LFU and random replacement)



I'd like to know
1) What page replacement algorithms are used in Linux kernel for OS file cache?



2) If possible, I'd like to know how it have evolved over time in Linux kernel. I assume that its algorithm and implementation might change overtime considering "reasonable" changes in trends. How can I find those? Do I need to read kernel source codes?










share|improve this question





















  • I'd suggest starting here, kernel.org and seeing if you can find the detail in the various documents and sources.
    – EightBitTony
    May 9 '16 at 9:05












up vote
5
down vote

favorite
6









up vote
5
down vote

favorite
6






6





Linux uses the unused portions of memory for file caching, and it cleans up the space when needed.



My question is about how it picks a victim page for replacement?
There are various algorithms (LRU, FIFO, LFU and random replacement)



I'd like to know
1) What page replacement algorithms are used in Linux kernel for OS file cache?



2) If possible, I'd like to know how it have evolved over time in Linux kernel. I assume that its algorithm and implementation might change overtime considering "reasonable" changes in trends. How can I find those? Do I need to read kernel source codes?










share|improve this question













Linux uses the unused portions of memory for file caching, and it cleans up the space when needed.



My question is about how it picks a victim page for replacement?
There are various algorithms (LRU, FIFO, LFU and random replacement)



I'd like to know
1) What page replacement algorithms are used in Linux kernel for OS file cache?



2) If possible, I'd like to know how it have evolved over time in Linux kernel. I assume that its algorithm and implementation might change overtime considering "reasonable" changes in trends. How can I find those? Do I need to read kernel source codes?







linux memory






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked May 9 '16 at 8:44









syko

281411




281411











  • I'd suggest starting here, kernel.org and seeing if you can find the detail in the various documents and sources.
    – EightBitTony
    May 9 '16 at 9:05
















  • I'd suggest starting here, kernel.org and seeing if you can find the detail in the various documents and sources.
    – EightBitTony
    May 9 '16 at 9:05















I'd suggest starting here, kernel.org and seeing if you can find the detail in the various documents and sources.
– EightBitTony
May 9 '16 at 9:05




I'd suggest starting here, kernel.org and seeing if you can find the detail in the various documents and sources.
– EightBitTony
May 9 '16 at 9:05










1 Answer
1






active

oldest

votes

















up vote
6
down vote



accepted










Linux MM does seem to be a bit arcane and challenging to track down.



Linux literature makes heavy mention of LRU (Least Recently Used) in the context of MM (memory management). I haven't noticed any of the other terms being mentioned.



If you're not familiar with how basic LRU is implemented in practice for virtual memory, I came across an interesting introduction (first four paragraphs) in this article on the incomparable LWN.net.



There is a design outline around version 2.6.28-32 here:



http://linux-mm.org/PageReplacementDesign



It suggests Clock-PRO is used for file pages. There is an original paper available on it. There is an old description of Clock-PRO on LWN.net, again including some practical implementation details. Apparently Clock-PRO "attempts to move beyond the LRU approach", a variant of which is "used in most systems". It seems to put more weight on frequency.



I believe one consideration is that it's not practical to implement LFU replacement. The kernel can't track every single read of a page, not when mmap() is used to access file cache pages (e.g. this is how most programs are loaded in memory), because the performance overhead would be prohibitive.



Linux-mm has another design document for implementing Clock-PRO in Linux. This one doesn't talk about it being merged in; it was written a few months before the LWN article about it. http://linux-mm.org/ClockProApproximation



More recent descriptions are that Linux merely "uses some ideas" from Clock-PRO, and is actually a "mash of a number of different
algorithms with a number of modifications for catching corner cases and
various optimisations".



The above quote was answered a question by Adrian McMenamin. McMenamin went on to complete an MSc project in 2011, testing modifications to Linux page replacement based on the "working set model". It includes a brief description of Linux page replacement. The "variant of LRU" is named as "the 2Q [two-queue] approach for database management", a number of references are provided, and there is a diagram illustrating movement between the two queues and other state transitions. He also describes Linux as using a partial implementation of CLOCK-PRO.



I expect the LRU concept, as opposed to the other possibilities you mention, was established from the start. And the most significant change was the introduction of Clock-PRO based features, i.e. putting some more weight on frequency.



In 2013 Linux gained "thrash detection-based file cache sizing". This is probably also relevant to the question.






share|improve this answer






















  • Great thanks to you! you guided me through this. I am going to start from 'PageReplacementDesign' page. 'thrash detection-based file cache sizing' looks also very interesting.
    – syko
    May 9 '16 at 13:49










Your Answer







StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "106"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: false,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













 

draft saved


draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f281974%2fwhat-page-replacement-algorithms-are-used-in-linux-kernel-for-os-file-cache%23new-answer', 'question_page');

);

Post as a guest






























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
6
down vote



accepted










Linux MM does seem to be a bit arcane and challenging to track down.



Linux literature makes heavy mention of LRU (Least Recently Used) in the context of MM (memory management). I haven't noticed any of the other terms being mentioned.



If you're not familiar with how basic LRU is implemented in practice for virtual memory, I came across an interesting introduction (first four paragraphs) in this article on the incomparable LWN.net.



There is a design outline around version 2.6.28-32 here:



http://linux-mm.org/PageReplacementDesign



It suggests Clock-PRO is used for file pages. There is an original paper available on it. There is an old description of Clock-PRO on LWN.net, again including some practical implementation details. Apparently Clock-PRO "attempts to move beyond the LRU approach", a variant of which is "used in most systems". It seems to put more weight on frequency.



I believe one consideration is that it's not practical to implement LFU replacement. The kernel can't track every single read of a page, not when mmap() is used to access file cache pages (e.g. this is how most programs are loaded in memory), because the performance overhead would be prohibitive.



Linux-mm has another design document for implementing Clock-PRO in Linux. This one doesn't talk about it being merged in; it was written a few months before the LWN article about it. http://linux-mm.org/ClockProApproximation



More recent descriptions are that Linux merely "uses some ideas" from Clock-PRO, and is actually a "mash of a number of different
algorithms with a number of modifications for catching corner cases and
various optimisations".



The above quote was answered a question by Adrian McMenamin. McMenamin went on to complete an MSc project in 2011, testing modifications to Linux page replacement based on the "working set model". It includes a brief description of Linux page replacement. The "variant of LRU" is named as "the 2Q [two-queue] approach for database management", a number of references are provided, and there is a diagram illustrating movement between the two queues and other state transitions. He also describes Linux as using a partial implementation of CLOCK-PRO.



I expect the LRU concept, as opposed to the other possibilities you mention, was established from the start. And the most significant change was the introduction of Clock-PRO based features, i.e. putting some more weight on frequency.



In 2013 Linux gained "thrash detection-based file cache sizing". This is probably also relevant to the question.






share|improve this answer






















  • Great thanks to you! you guided me through this. I am going to start from 'PageReplacementDesign' page. 'thrash detection-based file cache sizing' looks also very interesting.
    – syko
    May 9 '16 at 13:49














up vote
6
down vote



accepted










Linux MM does seem to be a bit arcane and challenging to track down.



Linux literature makes heavy mention of LRU (Least Recently Used) in the context of MM (memory management). I haven't noticed any of the other terms being mentioned.



If you're not familiar with how basic LRU is implemented in practice for virtual memory, I came across an interesting introduction (first four paragraphs) in this article on the incomparable LWN.net.



There is a design outline around version 2.6.28-32 here:



http://linux-mm.org/PageReplacementDesign



It suggests Clock-PRO is used for file pages. There is an original paper available on it. There is an old description of Clock-PRO on LWN.net, again including some practical implementation details. Apparently Clock-PRO "attempts to move beyond the LRU approach", a variant of which is "used in most systems". It seems to put more weight on frequency.



I believe one consideration is that it's not practical to implement LFU replacement. The kernel can't track every single read of a page, not when mmap() is used to access file cache pages (e.g. this is how most programs are loaded in memory), because the performance overhead would be prohibitive.



Linux-mm has another design document for implementing Clock-PRO in Linux. This one doesn't talk about it being merged in; it was written a few months before the LWN article about it. http://linux-mm.org/ClockProApproximation



More recent descriptions are that Linux merely "uses some ideas" from Clock-PRO, and is actually a "mash of a number of different
algorithms with a number of modifications for catching corner cases and
various optimisations".



The above quote was answered a question by Adrian McMenamin. McMenamin went on to complete an MSc project in 2011, testing modifications to Linux page replacement based on the "working set model". It includes a brief description of Linux page replacement. The "variant of LRU" is named as "the 2Q [two-queue] approach for database management", a number of references are provided, and there is a diagram illustrating movement between the two queues and other state transitions. He also describes Linux as using a partial implementation of CLOCK-PRO.



I expect the LRU concept, as opposed to the other possibilities you mention, was established from the start. And the most significant change was the introduction of Clock-PRO based features, i.e. putting some more weight on frequency.



In 2013 Linux gained "thrash detection-based file cache sizing". This is probably also relevant to the question.






share|improve this answer






















  • Great thanks to you! you guided me through this. I am going to start from 'PageReplacementDesign' page. 'thrash detection-based file cache sizing' looks also very interesting.
    – syko
    May 9 '16 at 13:49












up vote
6
down vote



accepted







up vote
6
down vote



accepted






Linux MM does seem to be a bit arcane and challenging to track down.



Linux literature makes heavy mention of LRU (Least Recently Used) in the context of MM (memory management). I haven't noticed any of the other terms being mentioned.



If you're not familiar with how basic LRU is implemented in practice for virtual memory, I came across an interesting introduction (first four paragraphs) in this article on the incomparable LWN.net.



There is a design outline around version 2.6.28-32 here:



http://linux-mm.org/PageReplacementDesign



It suggests Clock-PRO is used for file pages. There is an original paper available on it. There is an old description of Clock-PRO on LWN.net, again including some practical implementation details. Apparently Clock-PRO "attempts to move beyond the LRU approach", a variant of which is "used in most systems". It seems to put more weight on frequency.



I believe one consideration is that it's not practical to implement LFU replacement. The kernel can't track every single read of a page, not when mmap() is used to access file cache pages (e.g. this is how most programs are loaded in memory), because the performance overhead would be prohibitive.



Linux-mm has another design document for implementing Clock-PRO in Linux. This one doesn't talk about it being merged in; it was written a few months before the LWN article about it. http://linux-mm.org/ClockProApproximation



More recent descriptions are that Linux merely "uses some ideas" from Clock-PRO, and is actually a "mash of a number of different
algorithms with a number of modifications for catching corner cases and
various optimisations".



The above quote was answered a question by Adrian McMenamin. McMenamin went on to complete an MSc project in 2011, testing modifications to Linux page replacement based on the "working set model". It includes a brief description of Linux page replacement. The "variant of LRU" is named as "the 2Q [two-queue] approach for database management", a number of references are provided, and there is a diagram illustrating movement between the two queues and other state transitions. He also describes Linux as using a partial implementation of CLOCK-PRO.



I expect the LRU concept, as opposed to the other possibilities you mention, was established from the start. And the most significant change was the introduction of Clock-PRO based features, i.e. putting some more weight on frequency.



In 2013 Linux gained "thrash detection-based file cache sizing". This is probably also relevant to the question.






share|improve this answer














Linux MM does seem to be a bit arcane and challenging to track down.



Linux literature makes heavy mention of LRU (Least Recently Used) in the context of MM (memory management). I haven't noticed any of the other terms being mentioned.



If you're not familiar with how basic LRU is implemented in practice for virtual memory, I came across an interesting introduction (first four paragraphs) in this article on the incomparable LWN.net.



There is a design outline around version 2.6.28-32 here:



http://linux-mm.org/PageReplacementDesign



It suggests Clock-PRO is used for file pages. There is an original paper available on it. There is an old description of Clock-PRO on LWN.net, again including some practical implementation details. Apparently Clock-PRO "attempts to move beyond the LRU approach", a variant of which is "used in most systems". It seems to put more weight on frequency.



I believe one consideration is that it's not practical to implement LFU replacement. The kernel can't track every single read of a page, not when mmap() is used to access file cache pages (e.g. this is how most programs are loaded in memory), because the performance overhead would be prohibitive.



Linux-mm has another design document for implementing Clock-PRO in Linux. This one doesn't talk about it being merged in; it was written a few months before the LWN article about it. http://linux-mm.org/ClockProApproximation



More recent descriptions are that Linux merely "uses some ideas" from Clock-PRO, and is actually a "mash of a number of different
algorithms with a number of modifications for catching corner cases and
various optimisations".



The above quote was answered a question by Adrian McMenamin. McMenamin went on to complete an MSc project in 2011, testing modifications to Linux page replacement based on the "working set model". It includes a brief description of Linux page replacement. The "variant of LRU" is named as "the 2Q [two-queue] approach for database management", a number of references are provided, and there is a diagram illustrating movement between the two queues and other state transitions. He also describes Linux as using a partial implementation of CLOCK-PRO.



I expect the LRU concept, as opposed to the other possibilities you mention, was established from the start. And the most significant change was the introduction of Clock-PRO based features, i.e. putting some more weight on frequency.



In 2013 Linux gained "thrash detection-based file cache sizing". This is probably also relevant to the question.







share|improve this answer














share|improve this answer



share|improve this answer








edited Sep 22 at 13:53

























answered May 9 '16 at 10:54









sourcejedi

20.3k42887




20.3k42887











  • Great thanks to you! you guided me through this. I am going to start from 'PageReplacementDesign' page. 'thrash detection-based file cache sizing' looks also very interesting.
    – syko
    May 9 '16 at 13:49
















  • Great thanks to you! you guided me through this. I am going to start from 'PageReplacementDesign' page. 'thrash detection-based file cache sizing' looks also very interesting.
    – syko
    May 9 '16 at 13:49















Great thanks to you! you guided me through this. I am going to start from 'PageReplacementDesign' page. 'thrash detection-based file cache sizing' looks also very interesting.
– syko
May 9 '16 at 13:49




Great thanks to you! you guided me through this. I am going to start from 'PageReplacementDesign' page. 'thrash detection-based file cache sizing' looks also very interesting.
– syko
May 9 '16 at 13:49

















 

draft saved


draft discarded















































 


draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f281974%2fwhat-page-replacement-algorithms-are-used-in-linux-kernel-for-os-file-cache%23new-answer', 'question_page');

);

Post as a guest













































































Popular posts from this blog

How to check contact read email or not when send email to Individual?

How many registers does an x86_64 CPU actually have?

Nur Jahan