A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language | Zendy

Nur Syahirah Zulkipli | Zendy; Siti Zanariah Satari | Zendy; Wan Nur Syahidah Wan Yusoff | Zendy

Open Access

A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language

Author(s) -

Nur Syahirah Zulkipli,

Siti Zanariah Satari,

Wan Nur Syahidah Wan Yusoff

Publication year - 2021

Publication title -

journal of physics. conference series

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.21

H-Index - 85

eISSN - 1742-6596

pISSN - 1742-6588

DOI - 10.1088/1742-6596/1988/1/012111

Subject(s) - outlier , univariate , python (programming language) , computer science , synthetic data , data mining , algorithm , artificial intelligence , programming language , multivariate statistics , machine learning

Synthetic data is artificial data that is created based on the statistical properties of the original data. The aim of this study is to generate a synthetic or simulated data for univariate circular data that follow von Mises (VM) distribution with various outliers scenario using Python programming language. The procedure of formulation a synthetic data generation is proposed in this study. The synthetic data is generated from various combinations of seven sample size, n and five concentration parameters, K. Moreover, a synthetic data will be generated by formulating a data generation procedure with different condition of outliers scenarios. Three outliers scenarios are proposed in this study to introduce the outliers in synthetic dataset by placing them away from inliers at a specific distance. The number of outliers planted in the dataset are fixed with three outliers. The synthetic data is randomly generated by using Python library and package which are ‘numpy’, ‘random’ and von Mises’. In conclusion, the synthetic data of univariate circular data from von Mises distribution is generated and the outliers are successfully introduced in the dataset with three outliers scenarios using Python. This study will be valuable for those who are interested to study univariate circular data with outliers and choose Python as an analysis tool.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore