linguistics Questions
7
Solved
Does anyone have a suggestion for where to find archives or collections of everyday English text for use in a small corpus? I have been using Gutenberg Project books for a working prototype, and wo...
Vandyke asked 26/9, 2008 at 2:15
5
Was wondering if anyone had any tips or could point me in the right direction to finding/creating some sort of algorithm to find rhyming words.
I specifically do not want to use an API, as ...
Arrival asked 19/12, 2012 at 21:13
3
Some background
I am a literature student at New College of Florida, currently working on an overly ambitious creative project. The project is geared towards the algorithmic generation of poetry. ...
Rupiah asked 12/1, 2012 at 21:44
2
Solved
The input is either:
(1) a bracketed representation of a tree with labeled internal nodes such as:
(S (N John) (VP (V hit) (NP (D the) (N ball))))
with output:
(Whether the lines are dashed ...
Euthenics asked 5/3, 2012 at 2:33
2
Solved
I'm a big fan of discovering sentences that can be rapped very quickly. For example, "gotta read a little bit of Wikipedia" or "don't wanna wind up in the gutter with a bottle of malt." (George Wat...
Whalen asked 27/2, 2012 at 3:38
1
Solved
The app I'm currently working on requires me to determine the part of speech of a word in NSString.
So basically is there a library/database/class which you can access in Objective C which allows ...
Vizor asked 6/2, 2012 at 21:12
1
I am experimenting with word sense disambiguation using wordnet for my project. As a part of the project, I would like to convert a derived adjective or an adverb form to it's root noun form.
For ...
Pronty asked 25/9, 2011 at 20:44
2
Solved
I've been playing with Haskell for about a month. For my first "real" Haskell project I'm writing a parts-of-speech tagger. As part of this project I have a type called Tag that represents a parts-...
Jailer asked 15/9, 2011 at 23:26
1
Is there any free database/place out there with commonality/usage ratios of English words? (British or U.S. English, doesn't matter)
I don't care about the exact numbers, only relative to eachothe...
Forelimb asked 30/8, 2011 at 19:40
3
Solved
TL;DR Summary: I need a single command-line application which I can use to get synonyms and other related words. It needs to be multi-lingual and works cross platform. Can anyone suggest a suitable...
Capitalize asked 28/4, 2011 at 11:9
1
Is there a public API which would let me lookup definitions for words? I've been searching for this for a bit but it's getting mixed up with the dictionary datastructure. I'm planning on usin...
Spiritualty asked 1/3, 2011 at 21:31
1
Solved
I'm learning NLP. I currently playing with Word Sense Disambiguation. I'm planning to use the semcor corpus as training data but I have trouble understanding the xml structure. I tried googling but...
Alike asked 3/1, 2011 at 10:27
2
Solved
I am using both Daitch-Mokotoff soundexing and Damerau-Levenshtein to find out if a user entry and a value in the application are "the same".
Is Levenshtein distance supposed to be used as an abso...
Candlestand asked 6/10, 2010 at 19:46
4
I wondered how you would go about tokenizing strings in English (or other western languages) if whitespaces were removed?
The inspiration for the question is the Sheep Man character in the Murakam...
Templia asked 3/10, 2010 at 21:43
1
Solved
In my news page project, I have a database table news with the following structure:
- id: [integer] unique number identifying the news entry, e.g.: *1983*
- title: [string] title of the text, e....
Horripilate asked 26/8, 2010 at 13:40
13
Solved
I have an algorithm that generates strings based on a list of input words. How do I separate only the strings that sounds like English words? ie. discard RDLO while keeping LORD.
EDIT: To clarify,...
Dulcedulcea asked 18/9, 2008 at 12:20
4
Solved
An interlinear gloss can be used to layout a translation of a document.
http://en.wikipedia.org/wiki/Interlinear_gloss
Usually this is done word-by-word or morpheme-by-morpheme. However, I ...
Illuminometer asked 25/5, 2010 at 18:19
2
Some languages, particularly Slavic languages, change the endings of people's names according to the grammatical context. (For those of you who know grammar or studied languages that do this to wor...
Eardrop asked 19/5, 2010 at 17:27
4
Solved
I am looking for word alignment tools and algorithms.
I am dealing with bilingual English - Hindi text, and currently working on
DTW (Dynamic Time Warping) algorithm
CLA (Competitive Linking Alg...
Gressorial asked 11/3, 2010 at 14:18
2
Solved
Let's say there is a sentence:
On March 1, he was born.
Changing it to
He was born on March 1.
doesn't break the sense of the sentence and it is still valid. Shuffling words in any other way...
Ethelstan asked 23/4, 2010 at 15:9
6
Solved
Let's say you should monitor the brand "ONE" online. What algorithms can be used to separate pages about the brand ONE from pages containing the common word ONE?
I'm thinking maybe Bayes could wor...
Washington asked 15/2, 2010 at 12:20
7
I'm making a boggle-like word game. The user is given a grid of letters like this:
O V Z W X
S T A C K
Y R F L Q
The user picks out a word using any adjacent chains of letters, like the word "ST...
Rabat asked 15/2, 2010 at 18:36
5
Solved
What are books about how to build a natural language parsing program like this:
input: I got to TALL you
output: I got to TELL you
input: Big RAT box
output: Big RED box
in: hoo un thum zend th...
Freezer asked 10/2, 2010 at 12:53
2
Solved
My goal is to analyze some corpus (twitter for the now) for emotional content. Just today I realized it would make a bit of sense to search for word stems as opposed to having an exhaustive list of...
Housum asked 12/8, 2009 at 8:2
9
I am writing a script to reverse all genders in a piece of text, so all gendered words are swapped - "man" is swapped with "woman", "she" is swapped with "he", etc. But there is an ambiguity as to ...
Contactor asked 7/11, 2009 at 14:41
© 2022 - 2024 — McMap. All rights reserved.