Your Full Name |
Please enter your full name (first and last). |
Email |
Your email. This is used to identify your account and Harvester will sent alert notices to this address. |
Phone |
Your phone number. We probably won't use this unless we need to talk over your submission if there is an error or if your email isn't working. |
State of Election District |
This is the state of the district for which the data harvesting is submitted, not necessarily your own district. |
County or District Name |
Normally, this is the county election district but it does vary by state. If you are monitoring statewide results, then use "Statewide" |
Prefix |
Choose a prefix for data files that will make sense, no spaces (underscore is okay). So results for a state will be grouped together, use state code for first two characters, then district, adn then what type of file. Like "CA_SanDiego_Results". Remember what you entered here so you can update any prior requests. This should be unique among any requests you submit. |
Enable Task Item |
Normally, enabled. But if you want to disable an entry, then send the same entry again (same prefix) and choose Disable, and no other entries will be considered or updated. |
Starting URL |
If "Extraction Type" is "Direct", then this is the full URL to the item to be archived. If "Extraction Type" is "Indirect", then this is the full (unchanging) URL to a web page which will provide the (changing) URL to the item to be archived associated with hyperlinked text on the page. |
Extraction Type |
If the object to be archived is at a single URL of a file which is dynamically modified, then choose "Direct". "Indirect" means that the harvester will first access an intervening web page to find the (changing) link to the data item associated with hyperlinked "Link Text" |
Link Text |
Used only with Indirect Extraction, this identifies the text on the first page which provides the link to the data item, where the second link is updated with each revision of the object. |
Allowed File Extension(s) |
Used only with Indirect Extraction, provide the file extension of the data item, like "PDF", "DOC", "XML", "XSL", "CSV", etc. (case insensitive). This will exclude spurious matches from the initial page. You can allow multiple types by specifying multiple extensions separated by commas. |
Default Object File Extension |
Provide the file extension of the data item for workfile and archive file so it can be easily opened. Default is 'htm'. Will replace active extensions like 'aspx' |
Scan Interval |
This is the rate at which the object will be checked. Default is 15 minutes. Set this as slow as reasonably possible to avoid harvesting too many intermediate revisions. |
No Update Alert Interval |
If an update is not seen within this interval, Harvester will send an email alert. Actual alert time is set 20% longer than the interval shown to allow some slop in the update due to various factors (holidays, etc.) |
Archive Each Version / Alert Only |
In some cases, it may be sufficient to get an alert if the object changes and then you can look at it yourself by accessing the URL. However, if the object is dynamically changing, as is the case with election results, it is best to archive each new version to provide a history of changes to the object. |
Enable Alerts |
If you want to disable email alerts, set this to Disable. Enabled by default. |
Comment |
Provide any additional comments here regarding if you are finding something unusual, have trouble finding the links, or need different Intervals than those shown. |