- Implement a web crawler
- japanese morphological analysis using janome
- Implement search engine
- Store in MongoDB
- Web frontend using Flask
More details are avairable from My Tech Blog(Japanese).
- Python 3.5
-
Clone repository
$ git clone git@github.com:mejiro/SearchEngine.git
-
Install python packages
$ cd SearchEngine $ pip install -r requirements.txt -c constraints.txt
-
MongoDB settings
-
Run
$ python manage.py crawler # build a index $ python manage.py webpage # access to http://127.0.0.1:5000