A Data Science Central Community
As you would know, SQL is essential for many tasks involving working with large quantities of data.
Members of Analytic Bridge might like to download my guide to SQL titled 'SQL Essentials' available for free download from my personal website www.markmcilroy.com
Also on my personal website is a practice SQL environment where you can query a sample database and try out the examples from the book.
This book and my…Continue
Added by Mark McIlroy on November 4, 2016 at 6:00pm — No Comments
Here's a new tool that can help to bridge the business-IT divide -- Executable English.
It's a platform on the Web that can support people socially writing their own apps, by typing Executable English knowledge into browsers.
For example, here's the "source code" of an app written in executable English:
Anyone on the…
Added by Adrian Walker on November 6, 2014 at 1:48pm — No Comments
Program development for data process often involves cross-database relational operations. The following example will illustrate Java’s method of handling these operations. sales table is in db2 database, employee table is in mysql database. The task is to join sales with employee through sellerid of sales table and eid of employee table, and filter out the data insales and employeethat…Continue
Added by Jessica May on September 9, 2014 at 1:03am — No Comments
The computing power of SQL for mass structured data is complete, that is to say, it is impossible to find anything that SQL cannot compute. But its support layer is too low, which can lead to over-elaborate operation in practical application.…Continue
Added by Jessica May on July 27, 2014 at 9:30pm — No Comments
OLAP is an important constituent part of BI(business intelligence).
Understood literally, OLAP is online analytical processing, that is, users conduct analytical operation on real-time business data.
But, currently the concept of OLAP is seriously narrowed, and only it refers to operations such as conducting drilling,…Continue
Added by Jessica May on May 27, 2014 at 8:00pm — No Comments
I have a 5-20K USD budget to get a tool (w/wo dedicated hardware), including training for 2 people, to prepare (clean, merge, etc...) and summarize (samples, grouping/aggregations, etc) large data sets (over 5GBs, over 4 million rows).
I need an efficient solution (fast data processing and fast to learn), and definitely easy to use. I would have done it with excel if excel could handle such large data files.
After preparing and summarizing the data, I'll use excel pivot tables…Continue
I attendedAngelHack Sydney recently during the month of May.
AngelHack is a hackathon where developers and entrepreneurs come together to prototype a viable business idea within 24 hours.
The project that I worked on was called "DropQuery". The basic concept is this.
* You have some data files - CSV, XLS, XML
* You want to quickly query it.
I talked to a few people at the…Continue
Added by Eric Bae on June 6, 2013 at 6:28pm — No Comments
Added by Michael Walker on October 24, 2012 at 5:43pm — No Comments
Java is the most widely used programming language with an outstanding architecture. It is the top preferred language to develop the enterprise application. However, Java is not fit for the mass data computation. If encountering the computation that is too complex to be represented in a single SQL statement or it is not allowed to add stored procedures to database, then drawbacks of Java would be highlighted.
Hereby is a case on how to deal with above questions with…Continue
Added by Daisy Ding on July 24, 2012 at 1:23am — No Comments