How To Build A Web Crawler - Trouble crawling table

+3 Thien Nguyen · January 27, 2016
/images/forum/upload/2016-01-27/e288e1d31584039ff179ec4abc28ae76.png

I'm trying to crawl the description belonging to ISBN13. But I can't seem to get it right. How would one even start to crawl this?

Post a Reply

Replies

Oldest  Newest  Rating
+1 Thien Nguyen · February 3, 2016
I fixed it with
data = []
table = soup.find('table', {'class': 'boxedbo
for row in table.findAll('tr'):
cols = row.findAll('td')
cols = [ele.text.strip() for ele in cols]
data.append([ele for ele in cols if ele])

Thanks for the help :)
0 sfolje 0 · February 4, 2016
nice!
0 sfolje 0 · January 27, 2016
How did you tried?
Also, It would be useful to post url of this page, to check it out.

On the first sight, I would crawl "specs_title", e.g. beautifulSoup( 'td' {class = specs_title'} ) , if you know what i am talking about.
  • 1

Python

107,143 followers
About

This section is all about snakes! Just kidding.

Links
Moderators
Bucky Roberts Administrator