📝 Distributed forward proxy servers to crawl Internet via multiple cloud instances. 📅 Jul 20, 2015 ⏱️ 3 min read Squid Haproxy Proxy Http Setup Distributed
📝 Building a scalable distributed web crawler which can perform both crawling and data extraction 📅 Apr 18, 2015 ⏱️ 5 min read Web-Crawler Scala Akka Kafka Couchbase Jsoup Big-Data Design Distributed Proxy