The reality is though that many sites currently use Flash to display content that I need to access. Here are some approaches for scraping Flash that I have tried:
- Check for AJAX requests that may carry the data you are after between the flash app and server
- Extract text with the Macromedia Flash Search Engine SDK
- Use OCR to extract the text directly
Most flash apps are self contained and so don't use AJAX, which rules out (1). And I have had poor results with (2) and (3).
Still no silver bullet...
No comments:
Post a Comment