When we started to query the Blinkx database on the term, sustainability, we were not sure how many we would have for review. We hoped for 2,500 to work with by the time the prototype would be ready (May) and expected 25% of that number to be of relevancy and quality. We did set an objective to include at least 250 programs in the prototype.
Over 300 (31%) are now in the database and we have more to review than expected.
Here is how we got to the 300.
We are now working with a pool of 3,200 programs and have reviewed for the prototype 1,052 (32%). Of that number, nearly 70% were rejected primarily because of duplication, no connection, or they were not appropriate (sustainability used in a different context): a full one third because they were duplicates. One reason for duplication is because Blinkx is aggregating from over 250 channels and many videos are posted to multiple channels, not always with the same title and description.