I am a Data Scientist with extensive experience in Python and Java development.
Dealing with 10-100GB certainly sounds like an interesting problem, ripe for parallelization!
I have experience with MapReduce, Hadoop, (Hive, HDFS), MPI, & GPU Programming (CUDA). After discussing the format of the data, and clarifying the project goals with you, I will be able to determine what will be the best approach to solve this problem. Furthermore, I will adjust my bid and time estimate to provide you with more accurate expectations.
When you get a chance, I would like to discuss the project in more detail with you. Also, please let me know if you have any questions about my experience/capabilities.
Regards,
William