Document to Database System Requirements

Architecture:

The server side consists of three components: the crawler, the database and the web server. For optimal performance these components are running on separate machines. However, if a simpler architecture is desired for evaluation purposes and limited usage is expected, they can be combined on a single or two machines with appropriate resources.

Software requirements, crawling server

  • OS: Linux preferred, Windows also OK

  • Java: Oracle JRE 1.6+

Software requirements, database server

  • Oracle 11g for the database with JChem Oracle Cartridge installed

Software requirements, web server

  • OS: Linux preferred, Windows also OK

  • Any Java servlet container, typically Tomcat 6+

Software requirements, client

  • Web browser: a recent version of Mozilla Firefox (tested on 17 and above), Google Chrome, or Internet Explorer 9 and above

Hardware requirements, crawling server

  • Depends mainly on the number and size of documents initial crawled and added or modified in each period

  • At least 4 cores are recommended

  • 4-8 GB RAM or more

  • 500 MB disk space for the application

Hardware requirements, web server

  • 2 cores or more are recommended

  • 4GB RAM or more

Hardware requirements, database server

  • data storage disk space requirements highly depend on the number of structures, approximately 3-4 GB for 1 M structures

Hardware requirements, client

  • Standard office hardware that can run a modern browser with JavaScript support with a decent performance