I made usingtm
in R
a DocumentTermMatrix (dtm). if I understand correctly, this matrix displays for each document how often each possible term occurs. Now I can inspect this matrix and I get
Terms
Docs can design door easy finish include light provide use water
176004 1 2 11 8 0 3 3 4 4 4
181288 1 2 11 8 0 2 3 4 4 4
182465 4 4 0 2 0 0 42 13 6 0
etc.
How can I now retrieve the vector of (for example) document 181288? So I will get something like
1 2 11 8 0 2 3 4 4 4 ………
Also, it says my dtm's sparsity is 100%, is it (by approximation) 100% empty?