photo

spiritfly

shared this problem
4 months ago

Employees Involved

photo

SCM

Admin

Statistics

1
Comments
1
Views

Share

Tags

17
votes

Getting A Lot of Empty Files When Generating Articles From Keyword File

I'm using a list of keywords from a text file to generate articles and only the google search option is checked. My keyword list contains keywords that are completely random and usually have searches.

So once in a while I've noticed that SCM will generate articles/titles that are empty Such as:

TITLE:

{}

BODY:

<div style='text-align:center'><iframe width='600' height='420' src='http://www.youtube.com/embed/CesqLZBjzRg' frameborder='0' allowfullscreen></iframe></div>\n\n<div style='text-align:center'><iframe width='600' height='420' src='http://www.youtube.com/embed/CesqLZBjzRg' frameborder='0' allowfullscreen></iframe></div>

It seems to be finding a video, but not text for title and body. I get this once in a while(around 1 in 10-15 keywords) and it really bothers me as I use these articles to feed GSA SER from a folder. I never noticed what specific keyword caused such articles, but from my understanding SCM should skip empty articles and should not produce anything at all if there is no text found.

Official Answer
photo Employee
SCM Posted 4 months ago

Seems like the problem is that the article is not empty, but it's missing paragraph and title content.

Because the article is technically not 'blank' scm will write it to disk.

This can also be the case if you disable all scraping and just use your own custom content, or maybe all you want is to scrape only images/video URLs.

I will write out what keyword was empty for content in the log for you at least.

Add Comment

Comments (1)

photo
10

No, scraping is enabled to google only, and gets keywords from a file. It would be more useful if you make it to not write to disk if there is only image/video scraped and no content title/body for that particular keyword IF there is a content source checked.

If one wants to scrape videos/images only he would check out all content sources. Then it should write all found images/videos..

Leave Comment

photo