A Data Science Central Community
Java doesn’t support set operations directly, so nested loops have to be used to realize the operations of intersection, union, complement and etc. between text files. If there are many text files, or the file to be computed is too big to be loaded into the memory, or it is required to perform set operations according to multiple fields, the code will become even more complicated. However, with the assistance of esProc, which supports set operations…Continue
Added by Lynn Guo on November 13, 2014 at 6:00pm — No Comments
Java doesn’t support set operations directly, so nested loops have to be used to realize the operations of intersection, union, complement and etc. between text files. If there are many text files, or the file to be computed is too big to be loaded into the memory, or it is required to perform set operations according to multiple fields, the code will become even more complicated. However, with the assistance of esProc, which supports set operations directly, Java can realize these…Continue
Added by Lynn Guo on November 11, 2014 at 12:00am — No Comments
Here's a new tool that can help to bridge the business-IT divide -- Executable English.
It's a platform on the Web that can support people socially writing their own apps, by typing Executable English knowledge into browsers.
For example, here's the "source code" of an app written in executable English:
Anyone on the…
Added by Adrian Walker on November 6, 2014 at 1:48pm — No Comments
Generally, Java programmers use poi or other open source packages to read and compute Excel data. These open source packages support low-level programming, which increases the overall learning cost and complicates the operation. But with the help of esProc, Java can avoid these problems.…Continue
Added by Jessica May on October 8, 2014 at 12:26am — No Comments
The join statements of the database can be used conveniently to perform the operation of alignment join. But sometimes the data is stored in the text files, and to compute it in Java alone we need to write a large number of loop statements. This makes the code cumbersome. Using esProc to help with programming in Java can solve the problem easily and quickly. Let’s look at how this works…Continue
Added by Jessica May on September 28, 2014 at 8:00am — No Comments
It is common to use R language to group and summarize data of files. Sometimes we may find ourselves processing comparatively big files which have smaller computed result and bigger source data. We cannot load them wholly to the memory when we need to compute them. The only solutions could be batch importing and computing as well as result merging. We’ll use an example in the following to illustrate the way of R language to group and summarize data from big text files.
Here is a file,…Continue
We all know the constant struggle between IT and business users when it comes to BI software: Business users want to access data in order to make fast decisions independently, without having to use IT as a middle man every time a new requirement arises, query is run, or data is added. Yet, IT is overwhelmed by constantly changing requests and requirements, and struggle to deliver data in an actionable time-frame. The…Continue
Added by Elana Roth on August 19, 2014 at 2:30am — No Comments
If I told you all of our modern activities–posts to social media sites, transaction records, online activities, digital pictures and videos, and mobile GPS signals to name a few– leave a trail of data behind that is estimated to produce about 2.5 quintillion bytes of data each day, would you have a real feel for how much data that is? Even if I defined that number more clearly as 1 followed by 18 zeros, many tech veterans and savvy business leaders still have trouble digesting the scope of…Continue
Added by Elana Roth on July 31, 2014 at 4:30am — No Comments
The computing power of SQL for mass structured data is complete, that is to say, it is impossible to find anything that SQL cannot compute. But its support layer is too low, which can lead to over-elaborate operation in practical application.…Continue
Added by Jessica May on July 27, 2014 at 9:30pm — No Comments
If you have your eyes set on running a company someday, you may wonder what skills you need to cultivate now, as well as the best age to make a run for the position. Well, we took data from the LinkedIn profiles of close to 1 million CEOs in the United States, and gained some great data intelligence using a business analytics software. Here’s your chance to learn what it takes to get that corner office as well as see how crunching big data of the head honchos can offer great…Continue
Added by Elana Roth on July 24, 2014 at 2:30am — No Comments
Both esProc and R language are typical data processing and analysis languages with two-dimension…Continue
Added by Jessica May on July 21, 2014 at 1:36am — No Comments
Text files often brings headache for data analysts, are there any more convenient way for text files process? I prepare a case on how esProc deal with it, including import various text files; process big text files; visit text files of hdfs; as well as general operations, such as, file moving, deleting and checking whether a file exists. The following will illustrate these functions through examples. …Continue
Added by Jessica May on July 14, 2014 at 1:30am — No Comments
While having a many-to-many relationship may sound steamy to some of you, when it comes to the data in your tables it can get complicated. That is, unless you understand the different scenarios, their resolutions, and how to build a good relationship with your data. Just as a general overview, a relationship specifies the logic used to combine data from one or more tables. You create relationships by connecting fields between two…Continue
Added by Elana Roth on July 10, 2014 at 11:00am — No Comments
Added by Phuoc Do on July 1, 2014 at 5:30pm — No Comments
OLAP is an important constituent part of BI(business intelligence).
Understood literally, OLAP is online analytical processing, that is, users conduct analytical operation on real-time business data.
But, currently the concept of OLAP is seriously narrowed, and only it refers to operations such as conducting drilling,…Continue
Added by Jessica May on May 27, 2014 at 8:00pm — No Comments
Sharing a lecture introducing business analytics to agroup of MBA and MIS students at Erasmus Rotterdam School of Management:
Part 1 of 4: https://www.youtube.com/watch?v=KFYYDrEV4wI
Part 2 of 4: …Continue
Added by Scott Mongeau on May 19, 2014 at 9:37am — No Comments
Data is everywhere (that's indisputable). What I didn't realize was how often--and how subtly--personal big trails get created every day.
Case in point, I went to Kohl's to use up a free $10 rewards coupon card last weekend. While I was checking out, the cashier asked me for my phone number. I paused. I didn't understand why he needed…Continue
Added by Mar on May 5, 2014 at 8:49pm — No Comments
Maintenance, Repair and Overhaul (MRO) operations are of extreme importance for some industries. They are particularly crucial for airplanes, helicopters, trains and heavy production machinery (such as power plant equipment). In fact, some equipment has a lifespan of 20 to 25 years, and over the course of its usage period, for every euro spent to purchase the equipment, MRO costs will be 3 to 3.5 euros.
In this market, the distribution of MRO activities varies widely between…Continue
Added by Michel Bruley on March 10, 2014 at 2:35am — No Comments
In my previous blog post I have explained the steps needed to solve a data analysis problem. Going further, I will be discussing in-detail each and every step of Data Analysis. In this post, we shall discuss about exploratory Analysis.
What is Exploratory Analysis?
All of us at some point in the process of examining…Continue
Added by aatash shah on February 27, 2014 at 5:36am — No Comments