from scrapy.selector import selector error

Tags:

I am unable to do the following:

from scrapy.selector import Selector

The error is:

File "/Desktop/KSL/KSL/spiders/spider.py", line 1, in from scrapy.selector import Selector ImportError: cannot import name Selector

It is as if LXML is not installed on my machine, but it is. Also, I thought this was a default module built into scrapy. Maybe not?

Thoughts?

482

asked Oct 16 '13 22:10

SMPLGRP

2 Answers

Try importing HtmlXPathSelector instead.

    from scrapy.selector import HtmlXPathSelector

And then use the .select() method to parse out your html. For example,

    sel = HtmlXPathSelector(response)
    site_names = sel.select('//ul/li')

If you are following the tutorial on the Scrapy site (http://doc.scrapy.org/en/latest/intro/tutorial.html), the updated example would look like this:

    from scrapy.spider import BaseSpider
    from scrapy.selector import HtmlXPathSelector

    class DmozSpider(BaseSpider):
        name = "dmoz"
        allowed_domains = ["dmoz.org"]
        start_urls = [
            "http://www.dmoz.org/Computers/Programming/Languages/Python/Books/",
            "http://www.dmoz.org/Computers/Programming/Languages/Python/Resources/"
        ]

        def parse(self, response):
            sel = HtmlXPathSelector(response)
            sites = sel.select('//ul/li')

            for site in sites:
                title = site.select('a/text()').extract()
                link = site.select('a/@href').extract()
                desc = site.select('text()').extract()
                print title, link, desc

Hope this helps!

161

answered Sep 20 '22 22:09

user256604

I encounter the same problem. I think there is something wrong with your scrapy version.

You could type scrapy version -v into cmd to check the version. As far as I know, the newest version is 0.24.4 (2014.10.23). You could visit http://scrapy.org/ to find the newest.

answered Sep 17 '22 22:09

yongkai

Related questions
                            
                                Converting an un-aware timestamp into an aware timestamp for UTC conversion
                            
                                Python open file unicode error
                            
                                Working with time values greater than 24 hours
                            
                                Python installing setuptools, ez_setup.py error
                            
                                Program for word reversal randomly skips out letters?
                            
                                FInd a US street address in text (preferably using Python regex)
                            
                                How to set the path for cairo in ubuntu-12.04?
                            
                                Python List Group by Date
                            
                                Why is univariate Horner in Fortran faster than NumPy counterpart while bivariate Horner is not
                            
                                Time complexity of casting lists to tuples in python and vice versa
                            
                                Deterministic python generator for K disparate M-sized subsets of a set
                            
                                Tweaking celery for high performance
                            
                                TypeError: unsupported operand type(s) for +=: 'builtin_function_or_method' and 'int'
                            
                                Selenium (Python): How to insert value on a hidden input?
                            
                                How to sort in python with multiple conditions?
                            
                                Find ordered vector in numpy array
                            
                                Double Summation in Python
                            
                                Convert fractional years to a real date in Python
                            
                                python-requests 2.0.0 - [Errno 8] _ssl.c:504: EOF occurred in violation of protocol
                            
                                Python: using __getitem__ in a class and using in to find that item

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

from scrapy.selector import selector error

Tags:

python

macos

web-scraping

lxml

scrapy

SMPLGRP

People also ask

2 Answers

user256604

yongkai

Recent Activity

Donate For Us