📝 Building a scalable distributed web crawler which can perform both crawling and data extraction 📅 Apr 18, 2015 ⏱️ 5 min read Web-Crawler Scala Akka Kafka Couchbase Jsoup Big-Data Design Distributed Proxy