data gathering
$30-100 USD
Πληρώθηκε κατά την παράδοση
create a web application that does data mining on twitter data
## Deliverables
Need a server side application that will collect data in the following manner:
1. Search for the string? X? in twitter
it will take the rest of the sentence after the string X till the period? punctuation? mark (.) and save it to a MySQL database into a table called APPSUGGESTION in a field called SUGGESTION
2. Go over each of the rows in the table and check how many times they appear on? Google? and save it to the? [url removed, login to view] field
3. Go over each of the rows in the table and check how many times they appear on twitter and save it to the? [url removed, login to view] field
the? APPSUGGESTION ? table should have these columns:
CREATIONTIME (datetime when this row was created)
SUGGESTIONDATE (datetime when this suggestion was made on tweeter)?
SUGGESTION (string with up to 1000 characters)
SOURCETYPE (string with up to 40 characters? should say TWITTER by default , we might add more sources in the future)?
SOURCE (string with up to 40 characters - twitter user that tweeted this suggestion)
GOOGLEOCCURRENCECOUNT (long - number of times the same suggestion was found on google)
TWITTEROCCURRENCECOUNT ((long - number of times the same suggestion was found on twitter)
this application will run every Y minutes automatically
there should be one web page to configure both X and Y:
X = how frequently the application will run (in minutes)
Y = which search sentence? to use
* the web page label for X = "Analyze sentences? beginning? with :" + X
* the web page label for Y = "Run every " + Y + " minutes"
for example if I setup the following?
* Analyze sentences? beginning? with :? "looking for an app that"
* Run every 30 minutes
one of the rows in the DB might look like this:
CREATIONTIME =? "5/1/2008 8:30:52 AM"
SUGGESTIONDATE =? "4/1/2008 8:30:52 AM"
SUGGESTION = "finds how many people are in a picture"
SOURCETYPE = "TWITTER"
SOURCE = "agulander"
GOOGLEOCCURRENCECOUNT = 23
TWITTEROCCURRENCECOUNT = 7
Ταυτότητα Εργασίας: #3646514