# Pastebin 5M1ESeGG [2020-07-16 16:25:49,752] INFO in request_consumer: Received a request! [2020-07-16 16:25:49,780] DEBUG in utils: Path not found: /data/listenbrainz/2020/7.parquet 'Path does not exist: hdfs://hadoop-master.spark-network:9000/data/listenbrainz/2020/7.parquet;' Fetching file for next date... [2020-07-16 16:25:49,780] ERROR in utils: Listening history missing form HDFS [2020-07-16 16:26:24,235] INFO in create_dataframes: Preparing users data and saving to HDFS... [2020-07-16 16:26:48,708] INFO in create_dataframes: Preparing recordings data and saving to HDFS... [2020-07-16 16:28:01,626] INFO in create_dataframes: Preparing listen data dump and playcounts, saving playcounts to HDFS... [2020-07-16 16:30:03,177] DEBUG in request_consumer: Pushing result to RabbitMQ... [2020-07-16 16:30:03,177] INFO in request_consumer: Done! [2020-07-16 16:30:03,177] INFO in request_consumer: Number of messages sent: 1 [2020-07-16 16:30:03,178] INFO in request_consumer: Average size of message: 160 bytes [2020-07-16 16:30:03,178] INFO in request_consumer: Request done! [2020-07-16 16:30:03,178] INFO in request_consumer: Received a request! [2020-07-16 16:30:03,421] INFO in train_models: Splitting dataframe... [2020-07-16 16:30:12,054] INFO in train_models: Training models... [2020-07-16 16:31:20,455] INFO in train_models: Saving model... [2020-07-16 16:31:22,351] INFO in train_models: Model saved! [2020-07-16 16:31:22,375] INFO in train_models: Saving model metadata... [2020-07-16 16:31:22,658] INFO in train_models: Model metadata saved... [2020-07-16 16:31:22,678] ERROR in request_consumer: Error in the query handler for query 'cf_recording.recommendations.train_model': [Errno 2] No such file or directory: '/rec/listenbrainz_spark/recommendations/html_files/Model-8f1b795e-5fdd-47d7-9bc5-891f15e31435-2020-07-16.html' Traceback (most recent call last): File "/rec/listenbrainz_spark/request_consumer/request_consumer.py", line 57, in get_result return query_handler(**params) File "/rec/listenbrainz_spark/recommendations/train_models.py", line 385, in main models_training_time) File "/rec/listenbrainz_spark/recommendations/train_models.py", line 293, in save_training_html save_html(model_html, context, 'model.html') File "/rec/listenbrainz_spark/recommendations/utils.py", line 10, in save_html with open(outputfile, 'w') as f: FileNotFoundError: [Errno 2] No such file or directory: '/rec/listenbrainz_spark/recommendations/html_files/Model-8f1b795e-5fdd-47d7-9bc5-891f15e31435-2020-07-16.html' [2020-07-16 16:31:22,704] INFO in request_consumer: Request done! [2020-07-16 16:31:22,705] INFO in request_consumer: Received a request! [2020-07-16 16:31:23,582] INFO in candidate_sets: Fetching listens to get top artists... [2020-07-16 16:31:23,594] INFO in candidate_sets: Fetching top artists... [2020-07-16 16:31:23,667] INFO in candidate_sets: Preparing top artists candidate set... [2020-07-16 16:31:23,702] INFO in candidate_sets: Fetching similar artists... [2020-07-16 16:31:23,764] INFO in candidate_sets: Preparing similar artists candidate set... [2020-07-16 16:31:23,793] INFO in candidate_sets: Saving candidate sets... [2020-07-16 16:31:28,699] ERROR in candidate_sets: Cannot save empty similar artist candidate set Traceback (most recent call last): File "/rec/listenbrainz_spark/recommendations/candidate_sets.py", line 297, in save_candidate_sets similar_artist_candidate_set_df.take(1)[0] IndexError: list index out of range param@newleader:~$