AI Recommendation System data crawler basedline
Showing
.gitignore
0 → 100644
.gitlab-ci.yml
0 → 100644
File added
File added
File added
File added
File added
File added
File added
api/category.py
0 → 100644
api/censor.py
0 → 100644
api/dev.py
0 → 100644
This diff is collapsed.
api/domain.py
0 → 100644
api/marketing.py
0 → 100644
This diff is collapsed.
api/services.py
0 → 100644
api/user.py
0 → 100644
File added
File added
File added
classification/categories.py
0 → 100644
File added
File added
classification/preprocess.py
0 → 100644
crawler-api.service
0 → 100644
crawler-processors.service
0 → 100644
crawler.service
0 → 100644
File added
crawler/items/news_header.py
0 → 100644
crawler/news_crawler.py
0 → 100644
File added
crawler/pipelines/models.py
0 → 100644
File added
init_data/add_topics.py
0 → 100644
This diff is collapsed.
This diff is collapsed.
init_data/data/start_url.csv
0 → 100644
init_data/init_database.py
0 → 100644
File added
File added
File added
kafka_module/consume.py
0 → 100644
kafka_module/kafka_admin.py
0 → 100644
log/crawler.log
0 → 100644
This source diff could not be displayed because it is too large.
You can
view the blob
instead.
This diff is collapsed.
This diff is collapsed.
mysql/engine.py
0 → 100644
This diff is collapsed.
mysql/entities.py
0 → 100644
This diff is collapsed.
requirements.txt
0 → 100644
This diff is collapsed.
run.py
0 → 100644
This diff is collapsed.
run_api.py
0 → 100644
This diff is collapsed.
run_processors.py
0 → 100644
This diff is collapsed.
start-crawler-api.sh
0 → 100644
This diff is collapsed.
start-crawler-processors.sh
0 → 100644
This diff is collapsed.
start-crawler.sh
0 → 100644
This diff is collapsed.
stop-crawler-api.sh
0 → 100644
This diff is collapsed.
stop-crawler-processors.sh
0 → 100644
This diff is collapsed.
stop-crawler.sh
0 → 100644
This diff is collapsed.
test/test_api.py
0 → 100644
This diff is collapsed.
test/test_categories.csv
0 → 100644
This diff is collapsed.
test/test_classify.py
0 → 100644
This diff is collapsed.
test/test_sqlalchemy.py
0 → 100644
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
utils/date_utils.py
0 → 100644
This diff is collapsed.
utils/entity_utils.py
0 → 100644
This diff is collapsed.
utils/html_process.py
0 → 100644
This diff is collapsed.
utils/news_processing.py
0 → 100644
This diff is collapsed.
utils/plot.py
0 → 100644
This diff is collapsed.
utils/read_data.py
0 → 100644
This diff is collapsed.
utils/remove_duplicate.py
0 → 100644
This diff is collapsed.
Please
register
or
sign in
to comment