I want to scrap the job website. i want to do some testing in scrapy shell. Hence if i type this <code>scrapy shell http://www.seek.com.au</code> Then if i type <code>from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor</code> then it works fine But if i do this <pre class="prettyprint"><code>scrapy shell http://www.seek.com.au/JobSearch?DateRange=31&SearchFrom=quick&Keywords=python&nation=3000 </code></pre> Then if i type <code>from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor</code> Then it says invalid bash command <code>from</code> and it exits the scrapy job and come on screen as stopped job <pre class="prettyprint"><code>>>> from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor -bash: from: command not found [5]+ Stopped scrapy shell http://www.seek.com.au/JobSearch?DateRange=31 [7] Done Keywords=php </code></pre>

apparently, you need to enclose your url within double quotes: <pre class="prettyprint"><code>scrapy shell "http://www.seek.com.au/JobSearch?DateRange=31&SearchFrom=quick&Keywords=python&nation=3000" >>> from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor >>> lx = SgmlLinkExtractor() </code></pre> then everything works smoothly (this above is my actual shell output) tried it without double quotes, doesn't work (the fetch thread keeps running and first key press exits to bash without changing my visual output, thus giving me the same error you have)

How can i use scrapy shell to with parameters on url

Tags:

python

django

scrapy

I want to scrap the job website. i want to do some testing in scrapy shell.

Hence if i type this

scrapy shell http://www.seek.com.au

Then if i type

from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor

then it works fine

But if i do this

Click to copy

scrapy shell http://www.seek.com.au/JobSearch?DateRange=31&SearchFrom=quick&Keywords=python&nation=3000

Then if i type

from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor

Then it says invalid bash command from and it exits the scrapy job and come on screen as stopped job

Click to copy

>>> from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor
-bash: from: command not found

[5]+  Stopped                 scrapy shell http://www.seek.com.au/JobSearch?DateRange=31
[7]   Done                    Keywords=php

234

asked Dec 11 '12 15:12

user1894766

1 Answers

apparently, you need to enclose your url within double quotes:

Click to copy

scrapy shell "http://www.seek.com.au/JobSearch?DateRange=31&SearchFrom=quick&Keywords=python&nation=3000"
>>> from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor
>>> lx = SgmlLinkExtractor()

then everything works smoothly (this above is my actual shell output)

tried it without double quotes, doesn't work (the fetch thread keeps running and first key press exits to bash without changing my visual output, thus giving me the same error you have)

185

answered Oct 13 '22 05:10

Samuele Mattiuzzo

Related questions
                            
                                Why don't backreferences work in Python's re.sub when using a replacement function?
                            
                                Python set().issubset() not working as expected
                            
                                Curl Post Json data not being read in Python Django
                            
                                Python HMAC-SHA1 vs Java HMAC-SHA1 different results
                            
                                Can I change the order where python looks for a module first?
                            
                                How do I avoid processing an empty stdin with python?
                            
                                Why is my file upload (to a Flask server) not appearing in request.files but is appearing in request.stream?
                            
                                Matplotlib plot pulse propagation in 3d
                            
                                Distribute/distutils specify Python version
                            
                                How to use numpy to add any two elements in an array and produce a matrix?
                            
                                Apache SSL vs Python Simple HTTP Server SSL security questions
                            
                                py.test: how to automatically detect an exception in a child process?
                            
                                Python How to use extended path length
                            
                                Creating subplots with differing shapes in matplotlib
                            
                                Embedding Python with C
                            
                                Different behaviour between python console and python script
                            
                                Using the tornado RequestHandler is it possible to get POST data without specifying a argument?
                            
                                User input variables in cx_Oracle?
                            
                                Python Speedup np.unique
                            
                                In nested classes, how to access outer class's elements from nested class in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can i use scrapy shell to with parameters on url

Tags:

python

django

scrapy

user1894766

People also ask

1 Answers

Samuele Mattiuzzo

Recent Activity

Donate For Us