Python: Check if string and its substring are existing in the same list

.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;

I've extracted keywords based on 1-gram, 2-gram, 3-gram within a tokenized sentence

list_of_keywords = 
for i in range(0, len(stemmed_words)):
 temp = 
 for j in range(0, len(stemmed_words[i])):
 temp.append([' '.join(x) for x in list(everygrams(stemmed_words[i][j], 1, 3)) if ' '.join(x) in set(New_vocabulary_list)])
 list_of_keywords.append(temp)

I've obtained keywords list as

['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
['sleep', 'anxiety', 'lack of sleep']

How can I simply the results by removing all substring within the list and remain:

['high blood pressure']
['anxiety', 'lack of sleep']

asked Mar 15 at 9:36

Lisa

918

Will all sub strings be split by a space? What should ['sub', 'string', 'substring'] become?

– Peilonrayz
Mar 15 at 17:30

add a comment |

I've extracted keywords based on 1-gram, 2-gram, 3-gram within a tokenized sentence

list_of_keywords = 
for i in range(0, len(stemmed_words)):
 temp = 
 for j in range(0, len(stemmed_words[i])):
 temp.append([' '.join(x) for x in list(everygrams(stemmed_words[i][j], 1, 3)) if ' '.join(x) in set(New_vocabulary_list)])
 list_of_keywords.append(temp)

I've obtained keywords list as

['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
['sleep', 'anxiety', 'lack of sleep']

How can I simply the results by removing all substring within the list and remain:

['high blood pressure']
['anxiety', 'lack of sleep']

asked Mar 15 at 9:36

Lisa

918

Will all sub strings be split by a space? What should ['sub', 'string', 'substring'] become?

– Peilonrayz
Mar 15 at 17:30

add a comment |

I've extracted keywords based on 1-gram, 2-gram, 3-gram within a tokenized sentence

list_of_keywords = 
for i in range(0, len(stemmed_words)):
 temp = 
 for j in range(0, len(stemmed_words[i])):
 temp.append([' '.join(x) for x in list(everygrams(stemmed_words[i][j], 1, 3)) if ' '.join(x) in set(New_vocabulary_list)])
 list_of_keywords.append(temp)

I've obtained keywords list as

['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
['sleep', 'anxiety', 'lack of sleep']

How can I simply the results by removing all substring within the list and remain:

['high blood pressure']
['anxiety', 'lack of sleep']

asked Mar 15 at 9:36

Lisa

918

I've extracted keywords based on 1-gram, 2-gram, 3-gram within a tokenized sentence

list_of_keywords = 
for i in range(0, len(stemmed_words)):
 temp = 
 for j in range(0, len(stemmed_words[i])):
 temp.append([' '.join(x) for x in list(everygrams(stemmed_words[i][j], 1, 3)) if ' '.join(x) in set(New_vocabulary_list)])
 list_of_keywords.append(temp)

I've obtained keywords list as

['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
['sleep', 'anxiety', 'lack of sleep']

How can I simply the results by removing all substring within the list and remain:

['high blood pressure']
['anxiety', 'lack of sleep']

python nlp

asked Mar 15 at 9:36

Lisa

918

asked Mar 15 at 9:36

Lisa

918

asked Mar 15 at 9:36

Lisa

918

asked Mar 15 at 9:36

Lisa

918

asked Mar 15 at 9:36

Lisa

918

Will all sub strings be split by a space? What should ['sub', 'string', 'substring'] become?

– Peilonrayz
Mar 15 at 17:30

add a comment |

Will all sub strings be split by a space? What should ['sub', 'string', 'substring'] become?

– Peilonrayz
Mar 15 at 17:30

Will all sub strings be split by a space? What should ['sub', 'string', 'substring'] become?

– Peilonrayz
Mar 15 at 17:30

add a comment |

5 Answers
5

active

oldest

votes

You could use this one liner:

b = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
result = [ i for i in b if not any( [ i in a for a in b if a != i] )]

I admit this is O(n²) and maybe will be slow in performance for large inputs.

This is basically a list comprehension of the following:

word_list = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

result = 
for this_word in word_list:
 words_without_this_word = [ other_word for other_word in word_list if other_word != this_word] 
 found = False
 for other_word in words_without_this_word:
 if this_word in other_word:
 found = True

 if not found:
 result.append(this_word)

result

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

answered Mar 15 at 9:42

Christian Sloper

2,776418

1

I believe this will be slightly faster by removing the inner list comprehension, such that it becomes a generator comprehension, like so: result = [i for i in b if not any(i in a for a in b if a != i)]

– beruic
Mar 15 at 12:16

add a comment |

If you have a large list of words, it might be a good idea to use a suffix tree.

Here's a package on PyPI.

Once you created the tree, you can call find_all(word) to get the index of every occurence of word. You simply need to keep the strings which only appear once:

from suffix_trees import STree
# https://pypi.org/project/suffix-trees/
# pip install suffix-trees

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure'] + ['sleep', 'anxiety', 'lack of sleep']
st = STree.STree(words)

st.find_all('blood')
# [0, 20, 26, 46]

st.find_all('high blood pressure')
# [41]

[word for word in words if len(st.find_all(word)) == 1]
# ['high blood pressure', 'anxiety', 'lack of sleep']

words needs to be a unique list of strings, so you might need to call list(set(words)) before generating the suffix-tree.

As far as I can tell, the whole script should run in O(n), with n being the total length of the strings.

edited Mar 15 at 17:14

answered Mar 15 at 17:03

Eric Duminil

41k63472

add a comment |

-1

assuming that order of your elements is from shortest string to longest string, you need to check if each element is substring of last one and then remove it from the list:

symptoms = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']


def removeSubstring(data):
 for symptom in data[:-1]:
 if symptom in data[-1]:
 print("Removing: ", symptom)
 data.remove(symptom)
 print(data)


removeSubstring(symptoms)

answered Mar 15 at 10:07

Anna Janiszewska

Thanks, but the way u suggested would be only workable for 1 longest string, simply tried with symptoms = ['blood', 'sleep', 'high blood pressure', 'lack of sleep']

– Lisa
Mar 15 at 10:15

2

It’s normally a real bad idea to remove things from a list while you are iterating over it.

– Christian Sloper
Mar 15 at 10:26

@ChristianSloper can you elaborate why?

– Anna Janiszewska
Mar 15 at 10:32

quora.com/…

– Christian Sloper
Mar 15 at 10:36

add a comment |

-1

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

superset_word = ''
#print (words)
for word in words:
 word_list_minus_word = [each for each in words if word != each]
 counter = 0
 for other_word in word_list_minus_word:
 if (other_word not in word):
 break
 else:
 counter += 1
 if (counter == len(word_list_minus_word)):
 superset_word = word
 break
print(superset_word)

answered Mar 15 at 10:09

Venfah Nazir

10914

This does not work on OP's second example

– Christian Sloper
Mar 15 at 10:45

add a comment |

-3

grams = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

unique_grams = [grams[i] for i in range(len(grams)) if not grams[i] in ' '.join(grams[i+1:])]

edited Mar 15 at 18:40

Vasilis G.

4,0452924

answered Mar 15 at 12:21

Jawad Ali Khan

It doesn't seem to work. For example with grams = ['a b c', 'b c', 'a', 'b', 'c'].

– Eric Duminil
Mar 15 at 17:07

add a comment |

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55179517%2fpython-check-if-string-and-its-substring-are-existing-in-the-same-list%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

5 Answers
5

active

oldest

votes

5 Answers
5

active

oldest

votes

You could use this one liner:

b = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
result = [ i for i in b if not any( [ i in a for a in b if a != i] )]

I admit this is O(n²) and maybe will be slow in performance for large inputs.

This is basically a list comprehension of the following:

word_list = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

result = 
for this_word in word_list:
 words_without_this_word = [ other_word for other_word in word_list if other_word != this_word] 
 found = False
 for other_word in words_without_this_word:
 if this_word in other_word:
 found = True

 if not found:
 result.append(this_word)

result

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

answered Mar 15 at 9:42

Christian Sloper

2,776418

1

I believe this will be slightly faster by removing the inner list comprehension, such that it becomes a generator comprehension, like so: result = [i for i in b if not any(i in a for a in b if a != i)]

– beruic
Mar 15 at 12:16

add a comment |

You could use this one liner:

b = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
result = [ i for i in b if not any( [ i in a for a in b if a != i] )]

I admit this is O(n²) and maybe will be slow in performance for large inputs.

This is basically a list comprehension of the following:

word_list = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

result = 
for this_word in word_list:
 words_without_this_word = [ other_word for other_word in word_list if other_word != this_word] 
 found = False
 for other_word in words_without_this_word:
 if this_word in other_word:
 found = True

 if not found:
 result.append(this_word)

result

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

answered Mar 15 at 9:42

Christian Sloper

2,776418

1

I believe this will be slightly faster by removing the inner list comprehension, such that it becomes a generator comprehension, like so: result = [i for i in b if not any(i in a for a in b if a != i)]

– beruic
Mar 15 at 12:16

add a comment |

You could use this one liner:

b = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
result = [ i for i in b if not any( [ i in a for a in b if a != i] )]

I admit this is O(n²) and maybe will be slow in performance for large inputs.

This is basically a list comprehension of the following:

word_list = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

result = 
for this_word in word_list:
 words_without_this_word = [ other_word for other_word in word_list if other_word != this_word] 
 found = False
 for other_word in words_without_this_word:
 if this_word in other_word:
 found = True

 if not found:
 result.append(this_word)

result

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

answered Mar 15 at 9:42

Christian Sloper

2,776418

You could use this one liner:

b = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']
result = [ i for i in b if not any( [ i in a for a in b if a != i] )]

I admit this is O(n²) and maybe will be slow in performance for large inputs.

This is basically a list comprehension of the following:

word_list = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

result = 
for this_word in word_list:
 words_without_this_word = [ other_word for other_word in word_list if other_word != this_word] 
 found = False
 for other_word in words_without_this_word:
 if this_word in other_word:
 found = True

 if not found:
 result.append(this_word)

result

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

answered Mar 15 at 9:42

Christian Sloper

2,776418

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

edited Mar 15 at 9:55

Chetan Ameta

6,03822237

answered Mar 15 at 9:42

Christian Sloper

2,776418

answered Mar 15 at 9:42

Christian Sloper

2,776418

answered Mar 15 at 9:42

Christian Sloper

2,776418

1

I believe this will be slightly faster by removing the inner list comprehension, such that it becomes a generator comprehension, like so: result = [i for i in b if not any(i in a for a in b if a != i)]

– beruic
Mar 15 at 12:16

add a comment |

1

I believe this will be slightly faster by removing the inner list comprehension, such that it becomes a generator comprehension, like so: result = [i for i in b if not any(i in a for a in b if a != i)]

– beruic
Mar 15 at 12:16

I believe this will be slightly faster by removing the inner list comprehension, such that it becomes a generator comprehension, like so: result = [i for i in b if not any(i in a for a in b if a != i)]

– beruic
Mar 15 at 12:16

add a comment |

If you have a large list of words, it might be a good idea to use a suffix tree.

Here's a package on PyPI.

Once you created the tree, you can call find_all(word) to get the index of every occurence of word. You simply need to keep the strings which only appear once:

from suffix_trees import STree
# https://pypi.org/project/suffix-trees/
# pip install suffix-trees

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure'] + ['sleep', 'anxiety', 'lack of sleep']
st = STree.STree(words)

st.find_all('blood')
# [0, 20, 26, 46]

st.find_all('high blood pressure')
# [41]

[word for word in words if len(st.find_all(word)) == 1]
# ['high blood pressure', 'anxiety', 'lack of sleep']

words needs to be a unique list of strings, so you might need to call list(set(words)) before generating the suffix-tree.

As far as I can tell, the whole script should run in O(n), with n being the total length of the strings.

edited Mar 15 at 17:14

answered Mar 15 at 17:03

Eric Duminil

41k63472

add a comment |

If you have a large list of words, it might be a good idea to use a suffix tree.

Here's a package on PyPI.

Once you created the tree, you can call find_all(word) to get the index of every occurence of word. You simply need to keep the strings which only appear once:

from suffix_trees import STree
# https://pypi.org/project/suffix-trees/
# pip install suffix-trees

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure'] + ['sleep', 'anxiety', 'lack of sleep']
st = STree.STree(words)

st.find_all('blood')
# [0, 20, 26, 46]

st.find_all('high blood pressure')
# [41]

[word for word in words if len(st.find_all(word)) == 1]
# ['high blood pressure', 'anxiety', 'lack of sleep']

words needs to be a unique list of strings, so you might need to call list(set(words)) before generating the suffix-tree.

As far as I can tell, the whole script should run in O(n), with n being the total length of the strings.

edited Mar 15 at 17:14

answered Mar 15 at 17:03

Eric Duminil

41k63472

add a comment |

If you have a large list of words, it might be a good idea to use a suffix tree.

Here's a package on PyPI.

Once you created the tree, you can call find_all(word) to get the index of every occurence of word. You simply need to keep the strings which only appear once:

from suffix_trees import STree
# https://pypi.org/project/suffix-trees/
# pip install suffix-trees

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure'] + ['sleep', 'anxiety', 'lack of sleep']
st = STree.STree(words)

st.find_all('blood')
# [0, 20, 26, 46]

st.find_all('high blood pressure')
# [41]

[word for word in words if len(st.find_all(word)) == 1]
# ['high blood pressure', 'anxiety', 'lack of sleep']

words needs to be a unique list of strings, so you might need to call list(set(words)) before generating the suffix-tree.

As far as I can tell, the whole script should run in O(n), with n being the total length of the strings.

edited Mar 15 at 17:14

answered Mar 15 at 17:03

Eric Duminil

41k63472

If you have a large list of words, it might be a good idea to use a suffix tree.

Here's a package on PyPI.

Once you created the tree, you can call find_all(word) to get the index of every occurence of word. You simply need to keep the strings which only appear once:

from suffix_trees import STree
# https://pypi.org/project/suffix-trees/
# pip install suffix-trees

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure'] + ['sleep', 'anxiety', 'lack of sleep']
st = STree.STree(words)

st.find_all('blood')
# [0, 20, 26, 46]

st.find_all('high blood pressure')
# [41]

[word for word in words if len(st.find_all(word)) == 1]
# ['high blood pressure', 'anxiety', 'lack of sleep']

words needs to be a unique list of strings, so you might need to call list(set(words)) before generating the suffix-tree.

As far as I can tell, the whole script should run in O(n), with n being the total length of the strings.

edited Mar 15 at 17:14

answered Mar 15 at 17:03

Eric Duminil

41k63472

edited Mar 15 at 17:14

answered Mar 15 at 17:03

Eric Duminil

41k63472

answered Mar 15 at 17:03

Eric Duminil

41k63472

answered Mar 15 at 17:03

Eric Duminil

41k63472

add a comment |

-1

assuming that order of your elements is from shortest string to longest string, you need to check if each element is substring of last one and then remove it from the list:

symptoms = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']


def removeSubstring(data):
 for symptom in data[:-1]:
 if symptom in data[-1]:
 print("Removing: ", symptom)
 data.remove(symptom)
 print(data)


removeSubstring(symptoms)

answered Mar 15 at 10:07

Anna Janiszewska

Thanks, but the way u suggested would be only workable for 1 longest string, simply tried with symptoms = ['blood', 'sleep', 'high blood pressure', 'lack of sleep']

– Lisa
Mar 15 at 10:15

2

It’s normally a real bad idea to remove things from a list while you are iterating over it.

– Christian Sloper
Mar 15 at 10:26

@ChristianSloper can you elaborate why?

– Anna Janiszewska
Mar 15 at 10:32

quora.com/…

– Christian Sloper
Mar 15 at 10:36

add a comment |

-1

assuming that order of your elements is from shortest string to longest string, you need to check if each element is substring of last one and then remove it from the list:

symptoms = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']


def removeSubstring(data):
 for symptom in data[:-1]:
 if symptom in data[-1]:
 print("Removing: ", symptom)
 data.remove(symptom)
 print(data)


removeSubstring(symptoms)

answered Mar 15 at 10:07

Anna Janiszewska

Thanks, but the way u suggested would be only workable for 1 longest string, simply tried with symptoms = ['blood', 'sleep', 'high blood pressure', 'lack of sleep']

– Lisa
Mar 15 at 10:15

2

It’s normally a real bad idea to remove things from a list while you are iterating over it.

– Christian Sloper
Mar 15 at 10:26

@ChristianSloper can you elaborate why?

– Anna Janiszewska
Mar 15 at 10:32

quora.com/…

– Christian Sloper
Mar 15 at 10:36

add a comment |

-1

assuming that order of your elements is from shortest string to longest string, you need to check if each element is substring of last one and then remove it from the list:

symptoms = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']


def removeSubstring(data):
 for symptom in data[:-1]:
 if symptom in data[-1]:
 print("Removing: ", symptom)
 data.remove(symptom)
 print(data)


removeSubstring(symptoms)

answered Mar 15 at 10:07

Anna Janiszewska

assuming that order of your elements is from shortest string to longest string, you need to check if each element is substring of last one and then remove it from the list:

symptoms = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']


def removeSubstring(data):
 for symptom in data[:-1]:
 if symptom in data[-1]:
 print("Removing: ", symptom)
 data.remove(symptom)
 print(data)


removeSubstring(symptoms)

answered Mar 15 at 10:07

Anna Janiszewska

answered Mar 15 at 10:07

Anna Janiszewska

answered Mar 15 at 10:07

Anna Janiszewska

answered Mar 15 at 10:07

Anna Janiszewska

Thanks, but the way u suggested would be only workable for 1 longest string, simply tried with symptoms = ['blood', 'sleep', 'high blood pressure', 'lack of sleep']

– Lisa
Mar 15 at 10:15

2

It’s normally a real bad idea to remove things from a list while you are iterating over it.

– Christian Sloper
Mar 15 at 10:26

@ChristianSloper can you elaborate why?

– Anna Janiszewska
Mar 15 at 10:32

quora.com/…

– Christian Sloper
Mar 15 at 10:36

add a comment |

Thanks, but the way u suggested would be only workable for 1 longest string, simply tried with symptoms = ['blood', 'sleep', 'high blood pressure', 'lack of sleep']

– Lisa
Mar 15 at 10:15

2

It’s normally a real bad idea to remove things from a list while you are iterating over it.

– Christian Sloper
Mar 15 at 10:26

@ChristianSloper can you elaborate why?

– Anna Janiszewska
Mar 15 at 10:32

quora.com/…

– Christian Sloper
Mar 15 at 10:36

Thanks, but the way u suggested would be only workable for 1 longest string, simply tried with symptoms = ['blood', 'sleep', 'high blood pressure', 'lack of sleep']

– Lisa
Mar 15 at 10:15

It’s normally a real bad idea to remove things from a list while you are iterating over it.

– Christian Sloper
Mar 15 at 10:26

@ChristianSloper can you elaborate why?

– Anna Janiszewska
Mar 15 at 10:32

quora.com/…

– Christian Sloper
Mar 15 at 10:36

add a comment |

-1

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

superset_word = ''
#print (words)
for word in words:
 word_list_minus_word = [each for each in words if word != each]
 counter = 0
 for other_word in word_list_minus_word:
 if (other_word not in word):
 break
 else:
 counter += 1
 if (counter == len(word_list_minus_word)):
 superset_word = word
 break
print(superset_word)

answered Mar 15 at 10:09

Venfah Nazir

10914

This does not work on OP's second example

– Christian Sloper
Mar 15 at 10:45

add a comment |

-1

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

superset_word = ''
#print (words)
for word in words:
 word_list_minus_word = [each for each in words if word != each]
 counter = 0
 for other_word in word_list_minus_word:
 if (other_word not in word):
 break
 else:
 counter += 1
 if (counter == len(word_list_minus_word)):
 superset_word = word
 break
print(superset_word)

answered Mar 15 at 10:09

Venfah Nazir

10914

This does not work on OP's second example

– Christian Sloper
Mar 15 at 10:45

add a comment |

-1

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

superset_word = ''
#print (words)
for word in words:
 word_list_minus_word = [each for each in words if word != each]
 counter = 0
 for other_word in word_list_minus_word:
 if (other_word not in word):
 break
 else:
 counter += 1
 if (counter == len(word_list_minus_word)):
 superset_word = word
 break
print(superset_word)

answered Mar 15 at 10:09

Venfah Nazir

10914

words = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

superset_word = ''
#print (words)
for word in words:
 word_list_minus_word = [each for each in words if word != each]
 counter = 0
 for other_word in word_list_minus_word:
 if (other_word not in word):
 break
 else:
 counter += 1
 if (counter == len(word_list_minus_word)):
 superset_word = word
 break
print(superset_word)

answered Mar 15 at 10:09

Venfah Nazir

10914

answered Mar 15 at 10:09

Venfah Nazir

10914

answered Mar 15 at 10:09

Venfah Nazir

10914

answered Mar 15 at 10:09

Venfah Nazir

10914

This does not work on OP's second example

– Christian Sloper
Mar 15 at 10:45

add a comment |

This does not work on OP's second example

– Christian Sloper
Mar 15 at 10:45

This does not work on OP's second example

– Christian Sloper
Mar 15 at 10:45

add a comment |

-3

grams = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

unique_grams = [grams[i] for i in range(len(grams)) if not grams[i] in ' '.join(grams[i+1:])]

edited Mar 15 at 18:40

Vasilis G.

4,0452924

answered Mar 15 at 12:21

Jawad Ali Khan

It doesn't seem to work. For example with grams = ['a b c', 'b c', 'a', 'b', 'c'].

– Eric Duminil
Mar 15 at 17:07

add a comment |

-3

grams = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

unique_grams = [grams[i] for i in range(len(grams)) if not grams[i] in ' '.join(grams[i+1:])]

edited Mar 15 at 18:40

Vasilis G.

4,0452924

answered Mar 15 at 12:21

Jawad Ali Khan

It doesn't seem to work. For example with grams = ['a b c', 'b c', 'a', 'b', 'c'].

– Eric Duminil
Mar 15 at 17:07

add a comment |

-3

grams = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

unique_grams = [grams[i] for i in range(len(grams)) if not grams[i] in ' '.join(grams[i+1:])]

edited Mar 15 at 18:40

Vasilis G.

4,0452924

answered Mar 15 at 12:21

Jawad Ali Khan

grams = ['blood', 'pressure', 'high blood', 'blood pressure', 'high blood pressure']

unique_grams = [grams[i] for i in range(len(grams)) if not grams[i] in ' '.join(grams[i+1:])]

edited Mar 15 at 18:40

Vasilis G.

4,0452924

answered Mar 15 at 12:21

Jawad Ali Khan

edited Mar 15 at 18:40

Vasilis G.

4,0452924

edited Mar 15 at 18:40

Vasilis G.

4,0452924

edited Mar 15 at 18:40

Vasilis G.

4,0452924

answered Mar 15 at 12:21

Jawad Ali Khan

answered Mar 15 at 12:21

Jawad Ali Khan

answered Mar 15 at 12:21

Jawad Ali Khan

It doesn't seem to work. For example with grams = ['a b c', 'b c', 'a', 'b', 'c'].

– Eric Duminil
Mar 15 at 17:07

add a comment |

It doesn't seem to work. For example with grams = ['a b c', 'b c', 'a', 'b', 'c'].

– Eric Duminil
Mar 15 at 17:07

It doesn't seem to work. For example with grams = ['a b c', 'b c', 'a', 'b', 'c'].

– Eric Duminil
Mar 15 at 17:07

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

搜尋此網誌

mjhjmtu