Skip to main content

All Questions

Tagged with
Filter by
Sorted by
Tagged with
0 votes
1 answer
124 views

Identifying synonymous rows of a text column in a dataframe using R

Suppose ABC is a dataframe as given below: ABC <- data.frame(Column1 = c(1.222, 3.445, 5.621, 8.501, 9.302), Column2 = c(654231, 12347, -2365, 90000, 12897), ...
Ray's user avatar
  • 351
0 votes
1 answer
182 views

Interpretation question: Textstat_similarity Quanteda

I have a dataset of 310,225 tweets. I want to find out how many tweets were same or similar. I calculated the similarity between the tweets using Quanteda's textstat frequency. I found that the ...
Shreya Agarwal's user avatar
1 vote
0 answers
224 views

(solved by updating) Measure similarity of sentences to target document's sentences (error msg)

I am using the R quanteda package (version 1.4.3, windows system) to find almost verbatim text-reuse in several documents. In the end, I want to compare the relative influence of the source documents, ...
awagner's user avatar
  • 117
0 votes
1 answer
279 views

R extracting job titles from list of sentences

I am currently writing a script (In R) to extract job titles from sentences which are from biographies of various corporate executives. I have a list of job titles and a list of sentences from the ...
Dyem's user avatar
  • 9
1 vote
0 answers
793 views

Quantifiying the similarity between two sentences

For a project I want to quantify the similarity between two strings of text. The strings are titles of research proposals, and within researchers we want to calculate the similarity between pairs of ...
Tom Bailey's user avatar
0 votes
1 answer
70 views

R: Row numbers unmatched for Sentence to word-table

From my previous problem, I have some texts in different rows, and from the text I am trying to generating word-table for each of the words. But problem is occurring when the row number of the text ...
bim's user avatar
  • 789