STATISTICS

STATISTICS

A paper

Presented to fulfill the requirment of Statistics

CHAPTER I
INTRODUCTION
A.           Background of study
Statistics is a branch of mathematics that deals with the collection, analysis and interpretation of data.  Data can be defined as groups of information that represent the qualitative or quantitative attributes of a variable or set of variables. In layman's terms, data in statistics can be any set of information that describes a given entity. An example of data can be the ages of the students in a given class. When you collect those ages, that becomes your data.
A set in statistics is referred to as a population. Though this term is commonly used to refer to the number of people in a given place, in statistics, a population refers to any entire set from which you collect data.
Today, statistics has become an important tool in the work of many academic disciplines such as medicine, psychology, education, sociology, engineering and physics, just to name a few. Statistics is also important in many aspects of society such as business, industry and government. Because of the increasing use of statistics in so many areas of our lives, it has become very desirable to understand and practise statistical thinking. This is important even if you do not use statistical methods directly.

B.            The Formulation of the problem
In this paper the writer explains about the formulation of the problem, that are:
1.      What are the definition of statistics, educational statistics, and Classification of Statistics?
2.      How the way to use and the functions of statistics in education
3.      What are the definition of statistics data and Kinds of statistics
4.      What are the principles of collecting statistics data

5.      How the way of collecting statistics data

6.      What are the tool of collecting data


C.           The purpose of the writing
1.        To know what are the definition of statistics, educational statistics, and Classification of Statistics.
2.        The way how to use and the functions of statistics in education
3.        To know what are the definition of statistics data and Kinds of statistics.
4.        To know what are the principles of collecting statistics data
5.        The way How to collect  statistics data
6.        To know what are the tool of collecting data
   

CHAPTER I
DISCUSSION

A.           Statistic and educational Statistics
1.             Definition of Statistic
Statistics, in short, is the study of data. It includes descriptive statistics (the study of methods and tools for collecting data, and mathematical models to describe and interpret data) and inferential statistics (the systems and techniques for making probability-based decisions and accurate predictions.
From Entomology Side, the name is implies. Statistics has its roots in the idea of "the state of things". The word itself comes from the ancient Latin term statisticum collegium, meaning "a lecture on the state of affairs". Eventually, this evolved into the Italian word statista, meaning "statesman", and the German word Statistic, meaning "collection of data involving the State". Gradually, the term came to be used to describe the collection of any sort of data.
Statistics is also as a subset of mathematics. As one would expect, statistics is largely grounded in mathematics, and the study of statistics has lent itself to many major concepts in mathematics: probability, distributions, samples and populations, the bell curve, estimation, and data analysis[1].
According the oxford dictionary, Statistics is a collection of information shows in numbers. Then Ir. M. Iqbal Hasan,. MM said, Statistics is the study of the ins and outs of the data, namely regarding the collection, processing, analyzers, interpretation and conclusion of the data in the form of numbers.
Besides that, according Irianto , Statistics basically a tool to provide an overview of an event through a simple form, either in the form of numbers - numbers and graphs - graphs, given its role as a tool, it is necessary to realize that the key to the success of the statistical analysis is still lies in its use[2].
Statistics is the study of the collection, analysis, interpretation, presentation, and organization of data. In applying statistics, e.g. a scientific, industrial, or societal problem, it is necessary to begin with a population or process to be studied. Populations can be diverse topics such as "all persons living in a country" or "every atom composing a crystal". It deals with all aspects of data including the planning of data collection in terms of the design of surveys and experiments[3].
In case census data cannot be collected, statisticians collect data by developing specific experiment designs and survey samples. Representative sampling assures that inferences and conclusions can safely extend from the sample to the population as a whole. An experimental study involves taking measurements of the system under study, manipulating the system, and then taking additional measurements using the same procedure to determine if the manipulation has modified the values of the measurements. In contrast, an observational study does not involve experimental manipulation.
Statistics can be said to have begun in ancient civilization, going back at least to the 5th century BC, but it was not until the 18th century that it started to draw more heavily from calculus and probability theory. Statistics continues to be an area of active research, for example on the problem of how to analyze Big data[4].
Statistics basically a tool to provide an overview of an event through a simple form, either in the form of figures and graphs. Originally statistical terms only a set of numbers that describe the state of the population, people's income, the level of agricultural production at any given time. in a statistical sense here just to give an idea of the past until now in the picture[5].
Statistics is concerned with scientific methods for collecting, organising, summarizing, presenting and analyzing data as well as deriving valid conclusions and making reasonable decisions on the basis of this analysis. Statistics is concerned with the systematic collection of numerical data and its interpretation. The word ‘statistic’ is used to refer to:
1.    Numerical facts, such as the number of people living in particular area.
2.    The study of ways of collecting, analyzing and interpreting the facts[6].

2.             Definition of Educational Statistics.
Statistics education is the practice of teaching and learning of statistics, along with the associated scholarly research.
Statistics is both a formal science and a practical theory of scientific inquiry, and both aspects are considered in statistics education. Education in statistics has similar concerns as does education in other mathematical sciences, like logic, mathematics, and computer science. At the same time, statistics is concerned with evidence-based reasoning, particularly with the analysis of data. Therefore education in statistics has strong similarities to education in empirical disciplines like psychology and chemistry, in which education is closely tied to "hands-on" experimentation.
Mathematicians and statisticians often work in a department of mathematical sciences (particularly at colleges and small universities). Statistics courses have been sometimes taught by non-statisticians, against the recommendations of some professional organizations of statisticians and of mathematicians.
Statistics education research is an emerging field that grew out of different disciplines and is currently establishing itself as a unique field that is devoted to the improvement of teaching and learning statistics at all educational levels[7].
Statistics educators have cognitive and non-cognitive goals for students. For example, former American Statistical Association (ASA) President Katherine Wallman defined statistical literacy as including the cognitive abilities of understanding and critically evaluating statistical results as well as appreciating the contributions statistical thinking can make.
One of the goals of the American Statistical Association is to improve statistics education at all levels. Through the Statistics Education Web (STEW), the ASA is reaching out to K-12 mathematics and science teachers who teach statistics concepts in their classrooms. STEW is an online resource for peer-reviewed lesson plans for K-12 teachers. The web site is maintained by the ASA and accessible to K-12 teachers throughout the world.
Statistics and probability concepts are included in K-12 curriculum standards, in particular the Common Core State Standards, and on state and national exams; however, few K-12 teachers have formal training or applied experience with statistical concepts. K-12 teachers need a place where they can find peer-reviewed teaching materials available in a standard format. Teachers also can benefit from guidance toward activities that are appropriate for their students' maturity levels and from the ability to select relevant, useful, and meaningful applications. The ASA is the logical entity to host a resource to support teachers in their efforts to master the content and incorporate it into their classrooms[8].
IASE, the International Association for Statistical Education, seeks to promote, support and improve statistical education at all levels everywhere around the world. It is the international umbrella organization for statistics education. It fosters international cooperation, and stimulates discussion and research. It disseminates ideas, strategies, research findings, materials and information using publications, international conferences, and increasingly, this website.
IASE is the education section of the International Statistical Institute (ISI), but may also be joined independently by those who wish to participate in IASE’s activities, or simply to support the work on improving statistics education and extending its outreach[9].
In other explanation, Education Statistics is the science that discuss or learn and develop principles - principles, methods and procedures to be used in the context of data collection, presentation, analysis of materials in the form of numeric information about things - things related to education and drawing conclusions, making of estimates in the forecast and scientific[10].
  
3.             Classification of Statistics
1)             Classification Statistics For analysis method:
a)    Descriptive Statistics
Descriptive statistics is the science of statistics relating to the activities of recording and summarizing the results of observations of events or human characteristics, place and so on, quantitative, or statistical study ways of collecting, and preparing, and presenting and describing data has been collected to a study.
The activities included in this category, such as data collection, data grouping, the determination of the value and statistical functions, create graphs, diagrams and drawings. Records of births, deaths and marriages per year are called statistics. Similarly descriptive regarding age, education level, as well as enrich composition of the population living in an area.
The main purpose of the operation of descriptive statistics is easier for people to read and understand the meaning of data. The scope includes descriptive statistics:
Ø The frequency distribution
Ø Measurement of central values (mean, mode, median and standard deviation), dispersion, skewers and kurtosis.
Ø Presentation of data in the form of graphs (histograms, polygons, ogive)
Ø Figures Index
Ø The time series or time series
b)   Inferential Statistics
Inferential statistics is the science that studies the statistical procedures for drawing conclusions about the entire population based on existing data (sample) or statistics relating to the activities of drawing conclusions from the facts, and making decisions based on facts.
In the inferential statistic contain parameter estimation, hypothesis testing, prediction and calculation of the degree of association between variables. The scope of inferential statistics includes:
Ø Probability
Ø Data Distribution
Ø Estimation of parameters
Ø Test the hypothesis included chi-square test and analysis of variance
Ø Regression Analysis
Ø Correlation Analysis

2)             Classification of Statistics According to the How It Works:
In the inferential statistics / Inductive, various statistical tests that can be used basically be divided into two groups, namely the Test Statistic Parametric and Non-Parametric Test Statistics.
a)    Statistical Parametric
Parametric Statistics is a statistical test that is already known in advance that the data scale and ratio scale interval data, the distribution (distribution) that is normally distributed data. When viewed from the amount of data, usually large amounts of data, at least great than or equal to 30 data. The larger the data, it will be close to normal assumptions.
b)   Non-Parametric Statistics
Non-Parametric Statistics is a statistical test of the unknown distribution of data and need not normally distributed population where quantities are not known or assumptions are required in the population (in parametric statistics) are not met. Thus these statistics can be regarded as statistical tests assume free[11].

4.             The Use and The functions of Statistics in Education
A function held by the statistics in the world of education is becoming a tool in the teaching-learning process.
In assessing the activities of the educational outcomes, an educator wearing certain norms; The norm is essentially a kind of size. The assessment results are usually expressed in a variety of ways. But the most common way is to put it in the form of numbers (numbers). It is assessed itself is progress or development of students as they go through the process of education in a given period of time. Actually is qualitative, but converted into quantitative data. In other words, the results of the assessment conducted quantification. Quantification reason it is certainly vary, but the most important reason is to make changes to the material information that is not of a figure into the material particulars of the figures, educators will be able to be more straightforward to obtain an overview of the progress or development which has been achieved by students, after they undergo the process of education. By using quantitative data an education will be able to obtain certainty, rather than using qualitative data. Because the educational outcomes assessment activities most common way is to use the data quantitative.
An important function as a tool, which is a tool to process, analyzes, and summarizes the results that have been achieved in the assessment activities.
For a professional educator, statistics also has a sizable usability. Because by the use of statistics as a tool, it is based on the data that he will be able to exact:
a.    Getting a picture, either specifically or picture a general description of a symptom, condition or event.
b.    Following the development or tidal regarding symptoms, circumstances or events that, from time to time.
c.    Testing, if the symptoms are different from one another symptom or not. If there are differences, whether those differences are significant differences or differences occurred simply by chance alone.
d.   Know if the problem has to do with one another symptom.
e.    Prepare a report in the form of quantitative data with regular, clear and concise.
f.     Logically draw conclusions, make decisions accurately and steadily, and can estimate or predict things that may happen in the future, and what concrete steps that may be performed by an educator[12].

B.            Statistics Data
1.             Definition of statistics data
Data statistic is something that does not have any meaning for the recipient and is still in need of a treatment. Data may manifest a state, pictures, sounds, letters, numbers, math, language or other symbols that can be used as an ingredient to look at the environment, objects, events, or a concept[13]. Then, statistical data is a collection of information or facts that describes a problem. Data is a representation of real-world facts that represent an object as a value that is recorded in the form of numbers, letters, symbols, text, images, sounds or combinations thereof[14].
Data can be defined as groups of information that represent the qualitative or quantitative attributes of a variable or set of variables, which is the same as saying that data can be any set of information that describes a given entity. Data in statistics can be classified into grouped data and ungrouped data.
Any data that you first gather is ungrouped data. Ungrouped data is data in the raw. An example of ungrouped data is a any list of numbers that you can think of.
In the statistics there is also called as Grouped Data. Grouped data is data that has been organized into groups known as classes. Grouped data has been 'classified' and thus some level of data analysis has taken place, which means that the data is no longer raw.
A data class is group of data which is related by some user defined property. For example, if you were collecting the ages of the people you met as you walked down the street, you could group them into classes as those in their teens, twenties, thirties, forties and so on. Each of those groups is called a class.
Each of those classes is of a certain width and this is referred to as the Class Interval or Class Size. This class interval is very important when it comes to drawing Histograms and Frequency diagrams. All the classes may have the same class size or they may have different classes sizes depending on how you group your data. The class interval is always a whole number[15].
Below is an example of grouped data where the classes have the same class interval.
Age (years)
Frequency
0 - 9
12
10 - 19
30
20 - 29
18
30 - 39
12
40 - 49
9
50 - 59
6
0        - 69
Solution:
Below is an example of grouped data where the classes have different class interval.
Age (years)
Frequency
Class Interval
0 - 9
15
10
10 - 19
18
10
20 - 29
17
10
30 - 49
35
20
50    - 79

2.             Kinds of Statistics Data
When a given data set is numerical in nature, it is necessary to carefully distinguish the actual nature of the variable being quantified. Statistical tests are generally specific for the kind of data being handled.
1)             Data on a nominal (or categorical) scale
Identifying the true nature of numerals applied to attributes that are not "measures" is usually straightforward and apparent. Examples in everyday use include road, car, house, and book and telephone numbers. A simple test would be to ask if re-assigning the numbers among the set would alter the nature of the collection. If the plates on a car are changed, for example, it still remains the same car in reality.
2)             Data on an Ordinal Scale
An ordinal scale is a scale with ranks. Those ranks only have sense in that they are ordered, that is what makes it ordinal scale. The distance [rank n] minus [rank n-1] is not guaranteed to be equal to [rank n-1] minus [rank n-2], but [rank n] will be greater than [rank n-1] in the same way [rank n-1] is greater than [rank n-2] for all n where [rank n], [rank n-1], and [rank n-2] exist. Ranks of an ordinal scale may be represented by a system with numbers or names and an agreed order.
We can illustrate this with a common example: the Likert scale. Consider five possible responses to a question, perhaps Our president is a great man, with answers on this scale
Response:
Strongly Disagree
Disagree
Neither Agree nor Disagree
Agree
Strongly Agree
Code:
1
2
3
4
5
Here the answers are a ranked scale reflected in the choice of numeric code. There is however no sense in which the distance between Strongly agree and Agree is the same as between Strongly disagree and Disagree. Numerical ranked data should be distinguished from measurement data.
3)             Measurement data
Numerical measurements exist in two forms, Meristic and continuous, and may present themselves in three kinds of scale: interval, ratio and circular.
a.              Meristic or discrete variables are generally counts and can take on only discrete values. Normally they are represented by natural numbers. The number of plants found in a botanist's quadrant would be an example. (Note that if the edge of the quadrant falls partially over one or more plants, the investigator may choose to include these as halves, but the data will still be meristic as doubling the total will remove any fraction).
b.             Continuous variables are those whose measurement precision is limited only by the investigator and his equipment. The length of a leaf measured by a botanist with a ruler will be less precise than the same measurement taken by micrometer. (Notionally, at least, the leaf could be measured even more precisely using a microscope with a gratitude.)
c.              Interval Scale Variables measured on an interval scale have values in which differences are uniform and meaningful but ratios will not be so. An oft quoted example is that of the Celsius scale of temperature. A difference between 5° and 10° is equivalent to a difference between 10° and 15°, but the ratio between 15° and 5° does not imply that the former is three times as warm as the latter.
d.             Ratio Scale Variables on a ratio scale have a meaningful zero point. In keeping with the above example one might cite the Kelvin temperature scale. Because there is an absolute zero, it is true to say that 400°K is twice as warm as 200°K, though one should do so with tongue in cheek. A better day-to-day example would be to say that a 180 kg Sumo wrestler is three times heavier than his 60 kg wife.
e.              Circular Scale When one measures annual dates, clock times and a few other forms of data, a circular scale is in use. It can happen that neither differences nor ratios of such variables are sensible derivatives, and special methods have to be employed for such data.
C.           Collecting Statistic Data
1.             Principles of Collecting statistic Data
Data collection is the process of gathering and measuring information on variables of interest, in an established systematic fashion that enables one to answer stated research questions, test hypotheses, and evaluate outcomes. The data collection component of research is common to all fields of study including physical and social sciences, humanities, business, etc. While methods vary by discipline, the emphasis on ensuring accurate and honest collection remains the same. The goal for all data collection is to capture quality evidence that then translates to rich data analysis and allows the building of a convincing and credible answer to questions that have been posed.
Regardless of the field of study or preference for defining data (quantitative, qualitative), accurate data collection is essential to maintaining the integrity of research. Both the selection of appropriate data collection instruments (existing, modified, or newly developed) and clearly delineated instructions for their correct use reduce the likelihood of errors occurring.
A formal data collection process is necessary as it ensures that data gathered are both defined and accurate and that subsequent decisions based on arguments embodied in the findings are valid.[2] The process provides both a baseline from which to measure and in certain cases a target on what to improve.
Consequences from improperly collected data include: Generally there are three types of data collection and they are:
1)             Surveys: Standardized paper-and-pencil or phone questionnaires that ask predetermined questions.
2)             Interviews: Structured or unstructured one-on-one directed conversations with key individuals or leaders in a community.
3)             Focus groups: Structured interviews with small groups of like individuals using standardized questions, follow-up questions, and exploration of other topics that arise to better understand participants.
ü   Inability to answer research questions accurately.
ü   Inability to repeat and validate the study.
Distorted findings result in wasted resources and can mislead other researchers to pursue fruitless avenues of investigation. This compromises decisions for public policy, and causes harm to human participants and animal subjects.

2.             The Way of Collecting Statistics data
As we have seen in the definition of statistics, data collection is a fundamental aspect and as a consequence, there are different methods of collecting data which when used on one particular set will result in different kinds of data. Let's move on to look at these individual methods of collection in order to better understand the types of data that will result.
1)             Census Data Collection
Census data collection is a method of collecting data whereby all the data from each and every member of the population is collected.
For example, when you collect the ages of all the students in a given class, you are using the census data collection method since you are including all the members of the population (which is the class in this case).
This method of data collection is very expensive (tedious, time consuming and costly) if the number of elements (population size) is very large. To understand the scope of how expensive it is, think of trying to count all the ten year old boys in the country. That would take a lot of time and resources, which you may not have.
2)             Sample Data Collection
Sample data collection, which is commonly just referred to as sampling that is a method which collects data from only a chosen portion of the population.
Sampling assumes that the portion that is chosen to be sampled is a good estimate of the entire population. Thus one can save resources and time by only collecting data from a small part of the population. But this raises the question of whether sampling is accurate or not. The answer is that for the most part, sampling is approximately accurate. This is only true if you choose your sample carefully to be able to closely approximate what the true population consists of.
Sampling is used commonly in everyday life, for example, all the different research polls that are conducted before elections. Pollsters don't ask all the people in a given state who they'll vote for, but they choose a small sample and assume that these people represent how the entire population of the state is likely to vote. History has shown that these polls are almost always close to accuracy, and as such sampling is a very powerful tool in statistics.
3)             Experimental Data Collection
Experimental data collection involves one performing an experiment and then collecting the data to be further analyzed. Experiments involve tests and the results of these tests are your data.
An example of experimental data collection is rolling a die one hundred times while recording the outcomes. Your data would be the results you get in each roll. The experiment could involve rolling the die in different ways and recording the results for each of those different ways.
Experimental data collection is useful in testing theories and different products and is a very fundamental aspect of mathematics and all science as a whole.
4)             Observational Data Collection
Observational data collection method involves not carrying out an experiment but observing without influencing the population at all. Observational data collection is popular in studying trends and behaviours of society where, for example, the lives of a bunch of people are observed and data is collected for the different aspects of their lives.

Based Pros and Cons, Each method of data collection has advantages and disadvantages.
1)              Resources.
When the population is large, a sample survey has a big resource advantage over a census. A well-designed sample survey can provide very precise estimates of population parameters - quicker, cheaper, and with less manpower than a census.

2)              Generalizability
Generalizability refers to the appropriateness of applying findings from a study to a larger population. Generalizability requires random selection. If participants in a study are randomly selected from a larger population, it is appropriate to generalize study results to the larger population; if not, it is not appropriate to generalize.
Observational studies do not feature random selection; so generalizing from the results of an observational study to a larger population can be a problem.

3)              Causal inference
Cause-and-effect relationships can be teased out when subjects are randomly assigned to groups. Therefore, experiments, which allow the researcher to control assignment of subjects to treatment groups, are the best method for investigating causal relationships.

3.    The tool of Collecting Data
Among the tools that can be used in the educational work of collecting statistical data can be pointed out here for example:
a.       List or check (check list)
b.      A graduated scale (Rating Scale)
c.       Guidelines for the interview (interview gulde)
d.      Questionnaire (list of questions that every question has been resolved in the answer to select or provide a place to fill in the answer[16].
  

REFERENCES

Moses, Lincoln E. (1986) Think and Explain with Statistics, Addison-Wesley, ISBN 978-0-201-15619-5 . pp. 1–3
Dr.H Agus Irianto,(2004)Statistik(Konsep dasar, Aplikasi, dan Pengembangannya), Jakarta, Kencana.
Hays, William Lee, (1973) Statistics for the Social Sciences, Holt, Rinehart and Winston, p.xii, ISBN 978-0-03-077945-9
http://en.wikipedia.org/wiki/Statistics (was taken on december, 04, 2014)
http://arlingsapri.blogspot.com/2014/03/pengertian-statistik-pendidikan.html
Moses, Lincoln E. (1986) Think and Explain with Statistics, Addison-Wesley, ISBN 978-0-201-15619-5 . pp. 1–3
        Dr.H Agus Irianto,(2004)Statistik(Konsep dasar, Aplikasi, dan Pengembangannya), Jakarta, Kencana.
ChairpersonDr. J. Jothikumar,(2005),Statistik Higher secondary-first year,Tamilnadu, Governman
Journal of statistics education, 2014, Statistics Education Web (STEW), North washington, American statistical education.
http://iase-web.org/ Journal International Association for statistical education,
http://yuni-elf.blogspot.com/2012/12/tugas-ii-statistik.html
http://yuni-elf.blogspot.com/2012/12/tugas-ii-statistik.html
Hays, William Lee, (1973) Statistics for the Social Sciences, Holt, Rinehart and Winston, p.xii, ISBN 978-0-03-077945-9







Comments

Popular Posts