Skip to content

Recherche incluant les m² vides #4

@Imtiaz88

Description

@Imtiaz88

Bonjour.

Merci pour votre outil.
J'essaie d'inclure les ventes qui ne mentionnent pas les mètres carrés, cependant celles ci n'apparaissent pas dans mon fichier final.
Cela provient a priori d'ici :
'spider_exceptions/TypeError': 10
TypeError: expected string or bytes-like object
Je n'arrive pas à modifier le code pour inclure ces ventes qui ne mentionnent pas le m².
Sauriez vous comment faire.
Merci

Voici un extrait des résultats :

2020-02-22 19:20:11 [scrapy.core.scraper] ERROR: Spider error processing <GET https://www.leboncoin.fr/ventes_immobilieres/1730717886.htm/> (referer: https://www.leboncoin.fr/recherche/?category=9&locations=Marseille&price=80000-99999&real_estate_type=2%2C1%2C3%2C4%2C5&immo_sell_type=new%2Cold&page=12)
Traceback (most recent call last):
File "c:\users*\miniconda3\lib\site-packages\scrapy\utils\defer.py", line 102, in iter_errback
yield next(it)
File "c:\users*
\miniconda3\lib\site-packages\scrapy\core\spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "c:\users*\miniconda3\lib\site-packages\scrapy\spidermiddlewares\offsite.py", line 29, in process_spider_output
for x in result:
File "c:\users*
\miniconda3\lib\site-packages\scrapy\core\spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "c:\users*\miniconda3\lib\site-packages\scrapy\spidermiddlewares\referer.py", line 339, in
return (_set_referer(r) for r in result or ())
File "c:\users*
\miniconda3\lib\site-packages\scrapy\core\spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "c:\users*\miniconda3\lib\site-packages\scrapy\spidermiddlewares\urllength.py", line 37, in
return (r for r in result or () if _filter(r))
File "c:\users*
\miniconda3\lib\site-packages\scrapy\core\spidermw.py", line 84, in evaluate_iterable
for r in iterable:
File "c:\users*\miniconda3\lib\site-packages\scrapy\spidermiddlewares\depth.py", line 58, in
return (r for r in result or () if _filter(r))
File "c:\Users*
\Downloads\projetappart-master\projetappart\spiders\appart.py", line 90, in parse_page
surface_bien = parse_surface_bien(response)
File "c:\Users*\Downloads\projetappart-master\projetappart\utils\utils.py", line 85, in parse_surface_bien
surface = re.findall(r'\d+', suface_str)
File "c:\users*
\miniconda3\lib\re.py", line 223, in findall
return _compile(pattern, flags).findall(string)
TypeError: expected string or bytes-like object
(...)
2020-02-22 19:20:12 [scrapy.core.engine] INFO: Closing spider (finished)
2020-02-22 19:20:12 [scrapy.extensions.feedexport] INFO: Stored csv feed (402 items) in: result_MRS_ALL_22.02.2020.19.16.csv
2020-02-22 19:20:12 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 258850,
'downloader/request_count': 425,
'downloader/request_method_count/GET': 425,
'downloader/response_bytes': 15495012,
'downloader/response_count': 425,
'downloader/response_status_count/200': 425,
'dupefilter/filtered': 1,
'elapsed_time_seconds': 11.678637,
'finish_reason': 'finished',
'finish_time': datetime.datetime(2020, 2, 22, 18, 20, 12, 753726),
'item_scraped_count': 402,
'log_count/DEBUG': 828,
'log_count/ERROR': 10,
'log_count/INFO': 11,
'request_depth_max': 12,
'response_received_count': 425,
'robotstxt/request_count': 1,
'robotstxt/response_count': 1,
'robotstxt/response_status_count/200': 1,
'scheduler/dequeued': 424,
'scheduler/dequeued/memory': 424,
'scheduler/enqueued': 424,
'scheduler/enqueued/memory': 424,
'spider_exceptions/TypeError': 10,
'start_time': datetime.datetime(2020, 2, 22, 18, 20, 1, 75089)}
2020-02-22 19:20:12 [scrapy.core.engine] INFO: Spider closed (finished)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions