I was wondering if there is any chance of R's text mining package having the following feature: <pre class="prettyprint"><code>myCorpus <- Corpus(DirSource(<directory-contatining-textfiles>),control=...) # add docs myCorpus.addDocs(DirSource(<new-dir>),control=...) </code></pre> Ideally I would like to incorporate additional documents into the existing corpus. Any help is appreciated

You should be able just to use <code>c(,)</code> as in <pre class="prettyprint"><code>> library(tm) > data("acq") > data("crude") > together <- c(acq,crude) > acq A corpus with 50 text documents > crude A corpus with 20 text documents > together A corpus with 70 text documents </code></pre> You can find more in the tm package documentation under <code>tm_combine</code>.

R text mining package: Allowing to incorporate new documents into an existing corpus

Tags:

text

r

text-mining

I was wondering if there is any chance of R's text mining package having the following feature:

myCorpus <- Corpus(DirSource(<directory-contatining-textfiles>),control=...)
# add docs
myCorpus.addDocs(DirSource(<new-dir>),control=...)

Ideally I would like to incorporate additional documents into the existing corpus.

Any help is appreciated

741

asked Jul 07 '11 20:07

Shivani Rao

1 Answers

You should be able just to use c(,) as in

> library(tm)
> data("acq")
> data("crude")
> together <- c(acq,crude)
> acq
A corpus with 50 text documents
> crude
A corpus with 20 text documents
> together
A corpus with 70 text documents

You can find more in the tm package documentation under tm_combine.

115

answered Oct 06 '22 04:10

Henry

Related questions
                            
                                Subset a table by columns and rows using a named vector in R
                            
                                How to unread hide excel sheet in R(read_excel)?
                            
                                Plotting decision tree results from tidymodels
                            
                                TidyText Clustering
                            
                                Grouped recurrence by periods over a data.table
                            
                                Cumulative vector in data table
                            
                                Rstudio pipe operator (%>%) shortcut (Ctrl+Shift+M) not working
                            
                                R: Extracting Rules from a Decision Tree
                            
                                Generate stochastic random deviates from a density object with R
                            
                                R : catching errors in `nls`
                            
                                Extracting Nouns and Verbs from Text
                            
                                How to write a c() function for custom S3 class in R
                            
                                Setting an xts Index
                            
                                Optimal method of comparing a vector of numbers to values in another vector
                            
                                Fill lower matrix with vector by row, not column
                            
                                Axis Color of Date Histogram in R
                            
                                Split a data frame into overlapping dataframes
                            
                                Comparing Kernel Density Estimation plots
                            
                                Adding floating point precision to qnorm/pnorm?
                            
                                How to silence the output from this R package?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With