Help: Fill form on new post

eternaloptimist

Well-Known Member
Joined
Jul 10, 2013
Messages
175
Hey all. Need some help here.
I need to write a script that will start on a page like this:



Then get the newest post and fill in the form with some details.



What should I look into i.e which libraries, packages etc? I'm comfortable with Javascript/node, Python and Ruby.
Thanks
 
Last edited:

eternaloptimist

Well-Known Member
Joined
Jul 10, 2013
Messages
175
[)roi(];18033050 said:
For Python try either Mechanize or Scrapy.
For Ruby try Nokogiri in combination with Mechanize

Just google for code examples; many easy to follow tutorials exist.

Thanks!
Looking at Mechanize right now. I basically need to be the first to respond to a post within my search criteria. Any ideas on monitoring new posts?
 

[)roi(]

Executive Member
Joined
Apr 15, 2005
Messages
6,282
Thanks!
Looking at Mechanize right now. I basically need to be the first to respond to a post within my search criteria. Any ideas on monitoring new posts?
Timing is probably key and that really comes down to how often new listing appear. Frequency Risk: most sites will employ some safeguards against bot searches; e.g. too frequent or too many searches and you might find your IP is blacklisted (banned) for a period.

There is probably a pattern to when the listings appear; for example: is it more likely for new listings to be added in the PM or the AM. This might imply you only need to scan in those time slots; and probably no more than once every hour.
 
Last edited:

eternaloptimist

Well-Known Member
Joined
Jul 10, 2013
Messages
175
[)roi(];18033346 said:
Timing is probably key and that really comes down to how often new listing appear. Frequency Risk: most sites will employ some safeguards against bot searches; e.g. too frequent or too many searches and you might find your IP is blacklisted (banned) for a period.

There is probably a pattern to when the listings appear; for example: is it more likely for new listings to be added in the PM or the AM. This might imply you only need to scan in those time slots; and probably no more than once every hour.

Makes so much sense now, thanks! I think I'm gonna have to take a risk.
 
Top