 |
| acquia | | Web Crawler Intern | | | |
We're looking for an intern who wants to take the lead on an a site web crawler Java project that looks over millions of websites online to find those powered by the Drupal content management system. Click here to learn about the history of this crawler project. We're looking for someone with passion, integrity and intelligence that can take full ownership of issues and be responsible for the majority of the crawler. Apply here.
Essential skills
* Expert in developing, maintaining, and troubleshooting Java applications on Linux.
* Experience with web standards and protocols such as HTTP, HTML, robots.txt, etc.
* Strong interest for multi-server distributed systems and large-scale databases.
* Exceptional verbal and written communication skills to work with remote employees.
* An ability to work both independently and collaboratively.
Desired skills
* Experience with Apache Hadoop, Apache HDFS, etc.
* Experience with Amazon EC2 or other cloud computing.
* Experience with test-driven development.
* Experience with using Subversion.
* Experience with MySQL.
* Experience taking over an existing project and extending it.
Job responsibilities
* Take full ownership of issues and be responsible for the majority of the crawler.
* Work with a team of Build, QA and Engineering and the CTO as needed to identify requirements, develop solutions, report and debug defects, find fixes or workarounds, and validate enhancement requests, document architecture and comment code.
* Work with the operations team to keep the crawler running.
| | | |
| | Apply Now | | |
|
|