Hi i'm trying to stem words with a python stemmer, i tried Porter and Lancaster, but they have the same problem. They can't stem correclty words that end with "er" or "e".
for example, they stem
computer --> comput
rotate --> rotat
this is a part of the code
line=line.lower()
line=re.sub(r'[^a-z0-9 ]',' ',line)
line=line.split()
line=[x for x in line if x not in stops]
line=[ porter.stem(word, 0, len(word)-1) for word in line]
# or 'line=[ st.stem(word) for word in line]'
return line
any idea to fix this problem?
computer -> comput
not correct ? I might be wrong butcomput
looks like a stem forcomputing
,computed
,computer
,computation
. Likerotat
seems common torotate
,rotation
etc. – Dispose