A Data Science Central Community
I have a 5-20K USD budget to get a tool (w/wo dedicated hardware), including training for 2 people, to prepare (clean, merge, etc...) and summarize (samples, grouping/aggregations, etc) large data sets (over 5GBs, over 4 million rows).
I need an efficient solution (fast data processing and fast to learn), and definitely easy to use. I would have done it with excel if excel could handle such large data files.
After preparing and summarizing the data, I'll use excel pivot tables for the analysis. I have no programming experience. The datasets are usually in csv or txt formats.