I need a python script to scrape movie and music data from the web. This is for my personal collection. I need title, publish date (or at least month and year), artist and album cover art but other items may be needed. Cover art is essential. Assume the source is imdb for video and amazon for music but always open to other sources that are easier to work with. The list of items must be obtained from the source. The data needs to be written into a database, postgresql, mysql, mongodb etc are all acceptable. Image files can be written to disk with the location stored in the db. I need the script not the completed database.
I need the python code preferably using scrapy or BeautifulSoup so that it can be modified later to acquire information from other sources or other categories. The code must be well commented.
More specifics will be provided on request.
I made a very similar project before: a product database querying bar codes on yoopsie. It was also a python scraper using BeautifulSoup 4. I can add SQLAlchemy as a layer of abstraction to the database so you can plug any DB you want (posgreSQL, MySQL, etc) without having to modify your code. Dont worry, the script would create the database by itself.