What to do?
- Crawl a whole website using python programming. I use sublime.text
The website: http://www.police.gov.bn/Lists/News/AllItems.aspx
- Overall, I need it also for the CSV file (excel)
Title
Date
Url
The whole content of the title.
(The website above only shows partial content. Have to click on the title again for whole content
but i want it to automatically all content being crawl and being shown in CSV)
For example:
Title : KAWAL KESELAMATAN KAMPUNG MELALUI PENGAWASAN KEJIRANAN (this is in Malay Language, you can view on the latest website as it is the same)
Date: Jumaat, 13 Sept
Url: http://www.police.gov.bn/Lists/News/ItemDisplayForm.aspx?ID=1581&ContentTypeId=0x01004EE1520DB09AC547981D92B83F9B1E48 (Have to click on the title for whole story)
The whole content: Start from 'Keutuhan hubungan bermasyarakat sesama berjiran mewujudkan... (First paragraph)
End until ' .... di kampung mereka bebas dari jenayah. " (Last paragraph)
- This crawl should be keep on updating like it retrieve latest news (loop)
- After above done, if there's a way for Translator API.
To translate from Malay Language to English Language.
Since this website is in Malay.
Get Free Quote!
356 Experts Online