A Data Science Central Community

*I was recently reminded of some work I had completed for the oil and gas industry many years ago and thought it would be useful to share with other analysts/statisticians.*

*For more general information see a case study I have put onto my personal blog (Offshore Storms Statistics and Oil Rigs).*

It was important for the oil and gas engineers to gain a better understanding of the offshore environment during storm conditions, and in particular if there would be an impact upon the mooring of their semi-submersible drilling rigs. (See for example this paper: Bowers J, Morton I, Mould G 1997. Weathering the storm – how OR steered a course between extreme statistics & offshore design. OR Insight; 10 3: 16-21).

It was necessary for me to use circular statistics to analyse the wind and wave directions and come up with the mean directions and a 95% confidence interval.

Linear statistics aren’t appropriate because there is a crossover problem. If we consider three wind directions of 358, 0 and 2 degrees respectively then using linear statistics to find the mean we would add them together and divide by three, to arrive at a mean wind direction of 120 degrees (but it should have given an answer of 0 degrees).

I started to read about how to calculate basic circular statistics from a very useful book by NI Fisher *Statistical analysis of circular data*, and soon realised that I would have to revise my school maths on how to work with trigonometric functions. The mean direction is found from equation 1. It turns out that the calculation of a 95% confidence interval for the sample mean of circular data has mathematical equations that are intractable, and approximations which are unwieldy. Firstly, you could then assume that the data fit the “von Mises” distribution (It’s a wild assumption, other distributions might be more appropriate). Then, given this leap of faith, a reasonable approximation to determine the concentration is as in equation 2 and following on from this, an estimate of the 95% confidence interval is given by equation 3. Maybe I have set the seeds for you to think about using circular statistics in your work, or alternatively you didn’t get this far because you were put off by the equations. Other practical examples are provided by Fisher and two examples are: the arrival times at an intensive care unit; and the vanishing directions of homing pigeons.

Please let me know if you are using circular statistics in your work, or you have any other comments.

**Further reading**

Bowers JA, Morton I, Mould GI 2000. __Directional statistics of the wind and waves__. Applied Ocean Research; 22: 13-30.

© 2019 AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Statistics -- New Foundations, Toolbox, and Machine Learning Recipes
- Book: Classification and Regression In a Weekend - With Python
- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions

## You need to be a member of AnalyticBridge to add comments!

Join AnalyticBridge