I want to strip information from a search I did on a Career Site
Here is the result of the search;
[login to view URL]
For each Result in each Page I need from the main record
Occupation License Name Licensing Agency State
And there is a page each of those links to from which I need;
URL
2008 Total Number of Licenses (Annual): # <varies)
Ideally you can strip from this too though I can't get it one 'page';
[login to view URL]
Each of the 20 or so Headings breaks expands into several subheadings.
I need to pull those Heading Names and then attach information from the link which leads to a page(s) of results I need to pull with which should grab;
Certification Name
Certification Type (they are broken out by Common/Advanced/Skill)
Certifying Organization
The Hyperlink from the linked page
The end of the url address which is '&soccode=******'
Example:
From Main Page
[login to view URL]
First Link is Architecture and Engineering which expands to first link is Aerospace Engineering and Operations Technicians
That opens to a page (actually 2 in this case) with a # of Certifications Starting with the heading Common the first of which is
Quality Technician Certification
So the first record would be
Occupation: Aerospace Engineering and Operations Technicians
Certification Type: Common
Certification: Quality Technician Certification
Organization: American Society for Quality
URL: [login to view URL]
URL_Suffix: &soccode=173021
All of the links on that result would have the same Occupation, Certification Type and URL_Suffix.
Then you'd pull the same for the second link in the main page 'Aerospace Engineers' and so on.
I think this is almost as easy as the first as the results are identically formed and easy to pull.