Data Flow:
business system –> log –> flume collectiong, aggregating etc. –> kafka streaming 数据清洗和处理 –> spark streaming 流式计算
MongoDB Concepts:
database / collection / document
- show dbs
- use database name, access
- db local current database name
- show collections
- CRUD
- one to many structure
Elasticsearch vs. Solr
- open http://localhost:9200
- check collection http://localhost:9200/_cat/indices
- check index http://localhost:9200/recommender/_search?pretty
- 条件查询 curl “localhost:9200/recommender/_search?pretty” -d ‘{“query”: {“bool” : {“must” : {“exists” : {“field”: “tags”}}}}}’
SQL
- count
- group by
- order
- distinct
Zookeeper
- jps check all java process
- start: ./bin/zkServer.sh start
Redis
- run src/redis-server