Subscribe to DSC Newsletter

All Blog Posts Tagged 'data' (134)

esProc Helps Process Structured Texts in Java – Set Operations

Java doesn’t support set operations directly, so nested loops have to be used to realize the operations of intersection, union, complement and etc. between text files. If there are many text files, or the file to be computed is too big to be loaded into the memory, or it is required to perform set operations according to multiple fields, the code will become even more complicated. However, with the assistance of esProc, which supports set operations directly, Java can realize these…

Continue

Added by Lynn Guo on November 11, 2014 at 12:00am — No Comments

Executable English for Self-Service BI

Here's a new tool that can help to bridge the business-IT divide -- Executable English.



It's a platform on the Web that can support people socially writing their own apps, by typing  Executable English knowledge into browsers.



For example, here's the "source code" of an app written in executable English:



    www.reengineeringllc.com/demo_agents/GrowthAndDebt1.agent



Anyone on the…

Continue

Added by Adrian Walker on November 6, 2014 at 1:48pm — No Comments

A use case to read and analyze Excel data in Java

Generally, Java programmers use poi or other open source packages to read and compute Excel data. These open source packages support low-level programming, which increases the overall learning cost and complicates the operation. But with the help of esProc, Java can avoid these problems.…

Continue

Added by Jessica May on October 8, 2014 at 12:26am — No Comments

Data alignment join in Java for easier text analytics

The join statements of the database can be used conveniently to perform the operation of alignment join. But sometimes the data is stored in the text files, and to compute it in Java alone we need to write a large number of loop statements. This makes the code cumbersome. Using esProc to help with programming in Java can solve the problem easily and quickly. Let’s look at how this works…

Continue

Added by Jessica May on September 28, 2014 at 8:00am — No Comments

A Method of Grouping and Summarizing Data of Big Text Files in R Language

It is common to use R language to group and summarize data of files. Sometimes we may find ourselves processing comparatively big files which have smaller computed result and bigger source data. We cannot load them wholly to the memory when we need to compute them. The only solutions could be batch importing and computing as well as result merging. We’ll use an example in the following to illustrate the way of R language to group and summarize data from big text files.

Here is a file,…

Continue

Added by Jessica May on August 24, 2014 at 8:54pm — 2 Comments

9 Questions to Determine If a BI Solution Is Truly Self-Service

A Tool That Grants Independence to Business Users

We all know the constant struggle between IT and business users when it comes to BI software: Business users want to access data in order to make fast decisions independently, without having to use IT as a middle man every time a new requirement arises, query is run, or data is added. Yet, IT is overwhelmed by constantly changing requests and requirements, and struggle to deliver data in an actionable time-frame. The…

Continue

Added by Elana Roth on August 19, 2014 at 2:30am — No Comments

How Much Data Will You Have in 3 Years?

If I told you all of our modern activities–posts to social media sites, transaction records, online activities, digital pictures and videos, and mobile GPS signals to name a few– leave a trail of data behind that is estimated to produce about 2.5 quintillion bytes of data each day, would you have a real feel for how much data that is? Even if I defined that number more clearly as 1 followed by 18 zeros, many tech veterans and savvy business leaders still have trouble digesting the scope of…

Continue

Added by Elana Roth on July 31, 2014 at 4:30am — No Comments

Some Cases illustrating drawbacks of SQL in data computing and analytics

The computing power of SQL for mass structured data is complete, that is to say, it is impossible to find anything that SQL cannot compute. But its support layer is too low, which can lead to over-elaborate operation in practical application.…

Continue

Added by Jessica May on July 27, 2014 at 9:30pm — No Comments

Analyzing the Top LinkedIn Skills of CEOs

If you have your eyes set on running a company someday, you may wonder what skills you need to cultivate now, as well as the best age to make a run for the position. Well, we took data from the LinkedIn profiles of close to 1 million CEOs in the United States, and gained some great data intelligence using a business analytics software. Here’s your chance to learn what it takes to get that corner office as well as see how crunching big data of the head honchos can offer great…

Continue

Added by Elana Roth on July 24, 2014 at 2:30am — No Comments

The difference Between esProc’s Sequence Table Object and R’s Data Frame on data processing

Both esProc and R language are typical data processing and analysis languages with two-dimension…

Continue

Added by Jessica May on July 21, 2014 at 1:36am — No Comments

How to Process Text Files in the Data Analytics

Text files often brings headache for data analysts, are there any more convenient way for text files process? I prepare a case on how esProc deal with it, including import various text files; process big text files; visit text files of hdfs; as well as general operations, such as, file moving, deleting and checking whether a file exists. The following will illustrate these functions through examples. …

Continue

Added by Jessica May on July 14, 2014 at 1:30am — No Comments

Many-to-Many Relationships & How to Be in a Good Relationship

The Many Faces of Data Relationships

While having a many-to-many relationship may sound steamy to some of you, when it comes to the data in your tables it can get complicated. That is, unless you understand the different scenarios, their resolutions, and how to build a good relationship with your data. Just as a general overview, a relationship specifies the logic used to combine data from one or more tables. You create relationships by connecting fields between two…

Continue

Added by Elana Roth on July 10, 2014 at 11:00am — No Comments

Coding Web Visualization Tutorials

Hello,

I'm starting a new series of tutorials for beginners to learn web visualization. My tutorials help bridge the gap to d3.js visualization. It's intended to teach audience the concepts of Javascript applicable for data visualizations.

Click here for…

Continue

Added by Phuoc Do on July 1, 2014 at 5:30pm — No Comments

What difficulties SQL have in OLAP

1. What OLAP We Need Indeed?

Category: Model

OLAP is an important constituent part of BI(business intelligence).

Understood literally, OLAP is online analytical processing, that is, users conduct analytical operation on real-time business data.

But, currently the concept of OLAP is seriously narrowed, and only it refers to operations such as conducting drilling,…

Continue

Added by Jessica May on May 27, 2014 at 8:00pm — No Comments

Introduction to business analytics online lecture

Sharing a lecture introducing business analytics to agroup of MBA and MIS students at Erasmus Rotterdam School of Management:

Summary:  https://www.youtube.com/watch?v=KmG7FJhms2Q

Part 1 of 4: https://www.youtube.com/watch?v=KFYYDrEV4wI

Part 2 of 4: …

Continue

Added by Scott Mongeau on May 19, 2014 at 9:37am — No Comments

Personal Big Data Trails

Data is everywhere (that's indisputable). What I didn't realize was how often--and how subtly--personal big trails get created every day. 

Case in point, I went to Kohl's to use up a free $10 rewards coupon card last weekend. While I was checking out, the cashier asked me for my phone number. I paused. I didn't understand why he needed…

Continue

Added by Mar on May 5, 2014 at 8:49pm — No Comments

Importance of detailed data in maintenance operation

Maintenance, Repair and Overhaul (MRO) operations are of extreme importance for some industries. They are particularly crucial for airplanes, helicopters, trains and heavy production machinery (such as power plant equipment). In fact, some equipment has a lifespan of 20 to 25 years, and over the course of its usage period, for every euro spent to purchase the equipment, MRO costs will be 3 to 3.5 euros.

 

In this market, the distribution of MRO activities varies widely between…

Continue

Added by Michel Bruley on March 10, 2014 at 2:35am — No Comments

Few Exploratory Analysis techniques explained

In my previous blog post I have explained the steps needed to solve a data analysis problem. Going further, I will be discussing in-detail each and every step of Data Analysis. In this post, we shall discuss about exploratory Analysis.

What is Exploratory Analysis?

“Understanding data…

Continue

Added by suresh kumar Gorakala on March 6, 2014 at 11:09pm — 2 Comments

Read this tutorial before you use Proc Corr

All of us at some point in the process of examining…

Continue

Added by aatash shah on February 27, 2014 at 5:36am — No Comments

Analytics Trends to look out for in 2014 (India specific)

What do we predict for the Indian Analytics Industry this year? Here is what we put together at IVY after talking with our analytics recruiters, alumni, clients, and industry associates:…

Continue

Added by Ivy Pro School on February 1, 2014 at 10:53pm — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

On Data Science Central

© 2019   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service