Introduction to Biostatistics

Benish Ali
24 Sept 202029:34

Summary

TLDRهذا النص يقدم مقدمة لعلم الاحصائيات الحيوية، يشرح أهمية الاحصائيات في الحياة اليومية وعلم البيولوجيا، ويسلط الضوء على الفرق بين الإحصاء الوصفي والإحصاء ال演绎. يتضمن النص خطوات لتحليل البيانات البيولوجية مثل تحديد السؤال البيولوجية، صياغة الفرضية، تحديد المتغيرات، اختيار الاختبار الإحصائي، إجراء التجربة، تحليل البيانات، وتقديم النتائج. كما يوصح ببعض المصادر التعليمية للتعلم.

Takeaways

  • 📊統計学は生物学や公衆衛生、その他の健康科学分野でデータを適切に解釈するため重要な分野です。
  • 🔢 統計情報は日常生活に広く存在し、人口調査や社会メディアの統計など多种形式で使われています。
  • 📚 統計学は、数量情報の収集とそのデータの処理方法を指し、大規模な集団に関する推測を小さなサンプルに基づいて引き出す方法も含みます。
  • 👨‍🔬 生物統計학は、生物学、公衆衛生、その他の健康科学分野で生成された科学データの適切な解釈を担当する統計学の分野です。
  • 📈 統計学の2つの主要なドメインは記述統計学と推論統計学で、前者は集団から収集したデータを用いてその集団について記述または結論を引き出すのに使用されます。
  • 🔎 推論統計学は、サンプルデータを用いてサンプルが取られた总体に関する結論に達します。
  • 📊 統計学は、数量データの組織、要約、記述の方法と、それらに基づく推論と一般化の方法に関連しています。
  • 👨‍🎓 統計学のリテラシーは、科学文献を読解、理解、評価するために不可欠です。
  • 🔬 自分の研究を実行し、権威的に結果を提示するために生物学者には統計学的リテラシーが必要です。
  • 📈 統計学のステップバイステップアプローチは、実験計画、実験実施、データ分析、結果提示の4つの主要なステップから成ります。
  • 📚 統計学の初心者向けの優れた書籍やオンラインリソース、統計計算パッケージが提案されています。

Q & A

  • ما هو البيان البياني؟

    -البيان البياني هو فرع من العلوم التطبيقية يتعامل مع تفسير البيانات العلمية المنشأة في مجالات مختلفة مثل البيولوجيا والصحة العامة.

  • لماذا من المهم دراسة البيان البيئي؟

    -يتطلب البيان البيئي المعرفة المطلوبة لقراءة وفهم وتقييم الأدب العلمية، بالإضافة إلى أنه يساعد في إجراء الأبحاث بطريقة علمية وتقديم النتائج بطريقة سليمة.

  • ما هي الفرق بين الإحصاء الوصفي والإستنتاجي؟

    -الإحصاء الوصفي يستخدم البيانات الجمع لتوصيف نفس المجموعة، بينما الإحصاء الاستنتاجي يستخدم بيانات العينة لاستنتاجات حول المجموعة الكبيرة.

  • ما هي خطوات تحليل البيانات البيولوجية بطريقة مفصلة؟

    -يتضمن تحليل البيانات البيولوجية خطوات مثل تحديد السؤال البيئي، تخطيط التجربة، إجراء التجربة، تحليل البيانات، وتقديم النتائج.

  • ما هي الدورة التعليمية المطلوبة لفهم البيان البيئي؟

    -يجب أن تتضمن الدورة التعليمية المعرفة الأساسية في البيان والاستنتاجات، وكيفية تطبيقها في تحليل البيانات العلمية.

  • كيف يمكن للطلاب التكنولوجيا المساعدة في تحليل البيانات العلمية؟

    -يمكن للطلاب استخدام الحزم الإلكترونية الحرة للقيام بعمليات التحليل الاحصائي في أبحاثهم، مثل Staff Pages.org و Social Statistics.

  • ما هي الكتب المقترحة للتعلم البياني؟

    -يتضمن الكتب المقترحة لتعلم البيان البياني كتاب "Fowler and Cohen"، وكتاب "McDonald's Handbook of Biological Statistics".

  • كيف يمكن للباحثون التأكد من أن البيانات تلبي متطلبات الاختبار الاحصائي؟

    -يمكن للباحثون التحقق من أن البيانات تلبي متطلبات الاختبار الاحصائي من خلال تحليل البيانات قبل تطبيق الاختبار، واختيار الاختبار المناسب إذا لم تلبي متطلبات الاختبار المختار.

  • ما هي أهمية ترتيب الفرضية الصفرية والفرضية البديلة في الأبحاث العلمية؟

    -تحدد الفرضية الصفرية والفرضية البديلة النتائج المتوقعة وتساعد في التحقق من الفرضيات المختلفة في الأبحاث العلمية.

Outlines

00:00

📊 مقدمة في biostatistics

النص المقدم يتحدث عن الأهمية الأساسية للbiostatistics أو الإحصاء الحيوية في مجال العلوم الحيوية والصحيةة. يبدأ النص بتعريف الbiostatistics، وكيف أن الإحصاء هي جزء من الحياة اليومية للعالميين، وخاصة في الدراسات البيولوجية. يتضمن النص مثالات على الاستخدامات اليومية للإحصاء مثل التقارير السنوية، التعدادات، الدراسات التوزيعية، السجلات الثقافية، والإحصاءات الاجتماعية. يشدد النص على أن الbiostatistics هي جزء من المعرفة المطلوبة للفهم والتقييم للأبحاث العلمية، وأن الbiostatistics تتضمن جمع المعلومات الكمية وطرق التعامل مع هذه المعلومات، بالإضافة إلى تقنيات ال演绎 للمعلومات الكمية.

05:01

🔍 تحليل الbiostatistics

يتحدث النص الثانوي عن الbiostatistics وكيفية تنظيم وتلخيص ووصف البيانات القابلة للقياس، وكذلك الأساليب لرسم ال演绎ات عنها. يوضح النص على أن هناك قسمين رئيسيين في الإحصاء: الإحصاء الوصفي والإحصاء ال演绎ي. يصف الإحصاء الوصفي بكيفية استخدام البيانات الجمعة على مجموعة لوصف أو استنتاج النتائج عن نفس المجموعة، مثل ترتيب العلامات الدراسية. أما الإحصاء ال演绎ي فيمكن استخدام البيانات العينة لستنتاج نتائج حول المجموعة الأصلية، مثل مقارنة الطول المتوسط للطلبة في الجامعة. يشدد النص على أهمية الbiostatistics في مجال العلوم الحيوية والصحيةة.

10:03

📚 الأهمية المطلوب لفهم biostatistics

يناقش النص ال üçüncü السببين الرئيسيين وراء الحاجة إلى biostatistics أو المعرفة الإحصائية لدى عالمي العلوم الحيوية. الأول هو القدرة على قراءة وفهم وتقييم الأدب العلمية، حيث أن الbiostatistics تساعد على فهم البيانات العلمية وتحليلها. الثاني هو القدرة على إجراء الأبحاث الخاصة وتقديم النتائج بطريقة سليمة، حيث أن الbiostatistics تتطلب من الباحثين التخطيط مسبق للطرق الإحصائية المطلوبة قبل إجراء الدراسة. يشدد النص على أن الbiostatistics جزء من خطة الدراسة وليس بعدة الدراسة.

15:03

👨‍🔬 دور الستاتيستicians في الbiostatistics

يتحدث النص الرابع عن دور الستاتيستicians في الbiostatistics، الذي يتضمن توجيه تصميم التجارب أو الدراسات قبل جمع البيانات، وتحليل البيانات باستخدام الإجراءات الإحصائية الصحيحة، وتقديم النتائج المتكاملة للباحثين والقرارة. يشدد النص على أهمية التخطيط الجيد والتفكير مسبقاً في الbiostatistics قبل إجراء الدراسة، وكيفية تقسيم الbiostatistics إلى خطوات مفصلة لتحليل البيانات البيولوجية.

20:04

🔬 خطوات الbiostatistics في تحليل البيانات

يصف النص الخامس خطوات الbiostatistics في تحليل البيانات البيولوجية بشكل مفصل. تبدأ الخطوات بتحديد السؤال البيئي الذي يريد الإجابة عنه، ووضع السؤال في شكل فرضية صفرية وبديلة، وتحديد المتغيرات الهامة، واختيار الاختبار الإحصائي المناسب، وإجراء التجربة، وتحليل البيانات، وعرض النتائج. يشدد النص على أهمية اختيار الاختبار الإحصائي المناسب بناءً على نوع البيانات والمتغيرات، وكيفية تغيير الاختبار إذا احتج الإجراء.

25:04

📈 تجميع النتائج والعرض عنها

يتحدث النص السادس في النهاية عن كيفية تجميع النتائج وعرضها بشكل فعال، باستخدام الرسوم البيانية والجداول. يشدد النص على أن الbiostatistics تساعد في تنظيم المعلومات وعرضها بطريقة مناسبة. يلخص النص الخطوات الهامة للبحث البيئي، بدءاً من تحديد السؤال البيئي، وتخطيط التجربة، وإجراء التجربة، وتحليل النتائج، وعرض النتائج.

Mindmap

Keywords

💡biostatistics

Biostatistics هو فرع من الإحصاء يستخدم في العلوم الحيوية والصحية، ويتعلق بتفسير البيانات العلمية بشكل صحيح. في الفيديو، يستخدم المصطلح لشرح الأهمية الأساسية للاستاتستيكا في تحليل البيانات العلمية، مثل الأبحاث البيولوجية والصحيةة، وكيفية استخدام البيانات الكمية لفهم النماذج واتخاذ القرارات العلمية.

💡statistics

الإحصاء هو علم يستخدم لجمع وترتيب ووصف البيانات الكمية، واستخراج ال演绎ات من البيانات. في الفيديو، يستخدم للإشارة إلى دور الأرقام والبيانات في فهم العالم واتخاذ القرارات، مثل التقرير السنوي والتعدادات والدراسات التوزيعية.

💡descriptive statistics

الإحصاء الوصفي هو فرع من الإحصاء يستخدم لوصف البيانات الكمية بطريقة تسمح بفهمها وتحليلها. في الفيديو، يستخدم للإشارة إلى الطريقة التي يتم بها ترتيب البيانات وبيانها لوصف مجموعة معينة، مثل ترتيب العلامات الدراسية للطلاب.

💡inferential statistics

الإحصاء ال演绎ي هو فرع من الإحصاء يستخدم لاستخراج ال演绎ات عن مجموعة كبيرة بناءً على الملاحظات على مجموعة أصغر. في الفيديو، يستخدم للإشارة إلى الطريقة التي يتم بها جمع البيانات من عينة وتطبيقها على المجموعة الكبيرة، مثل تقدير معدل انتشار الأمراض.

💡null hypothesis

الفرض ال_NULL__ هو فرضية تثبت أنها لا توجد فرق بين المجموعات في الدراسة. في الفيديو، يستخدم للإشارة إلى الطريقة العلمية التقليدية لوضع假设 للبحث، حيث يتم اختبار假设 ال_NULL__ قبل الacceptance أو الrejection.

💡alternate hypothesis

الفرض البديل هو فرضية تثبت أنها يوجد فرق بين المجموعات في الدراسة. في الفيديو، يستخدم للإشارة إلى النتيجة المطلوبة إذا اتضح أن假设 ال_NULL__ غير صحيح، مثل وجود فرق في معدل انتشار الأمراض بين الSmokers والnon-smokers.

💡quantitative information

المعلومات الكمية هي المعلومات التي يمكن م量的ها بأرقام. في الفيديو، يستخدم للإشارة إلى النوع المطلوب من المعلومات لإجراء التحليل الإحصائي، مثل التعدادات والمتوسطات.

💡data collection

جمع البيانات هو عملية جمع المعلومات المطلوبة لإجراء التحليل. في الفيديو، يستخدم للإشارة إلى خطوة مهمة في الresearch حيث يتم جمع المعلومات الكمية من الأفراد، مثل سجل التاريخ التدخيني.

💡statistical test

اختبار الإحصاء هو طريقة تستخدم لتحليل البيانات واستخراج ال演绎ات العلمية. في الفيديو، يستخدم للإشارة إلى الطريقة التي يتم بها اختيار اختبار مناسب بناءً على النوع والحجم للبيانات، مثل اختبار chí-square للمقارنة بين المجموعات.

💡research design

تصميم الresearch هو خطة تحدد الطريقة التي سيتم بها إجراء الresearch. في الفيديو، يستخدم للإشارة إلى الخطوات المطلوبة قبل إجراء الresearch، مثل تحديد السؤال البيئي وتحديد ال假设ات والإحصائيات المطلوبة.

Highlights

Introduction to biostatistics and its importance in daily life and scientific research.

Definition of statistics as a part of life for biologists and its application in various fields.

Examples of statistical information in daily life such as annual reports, censuses, and social media statistics.

The role of statistics in making informed decisions based on customer feedback and prevalence reports.

Clarification that statistics is not alien or purely mathematical, but already in use in daily life.

Statistics defined as the collection of quantitative information and methods of handling data.

Explanation of how statistics is used to draw inferences about large groups from smaller observations.

The two main parts of statistics: descriptive and inferential statistics.

Descriptive statistics focuses on organizing and summarizing data from a specific group.

Inferential statistics uses sample data to make conclusions about a larger population.

Biostatistics defined as the branch of statistics for interpreting scientific data in biology and health sciences.

The necessity of statistical literacy for reading, understanding, and evaluating scientific literature.

The importance of statistical literacy for biologists to undertake investigations and present results authoritatively.

The role of statisticians in guiding the design of experiments and analyzing data.

A step-by-step approach to analyzing biological data, emphasizing the planning and execution of research.

The process of specifying a biological question, formulating hypotheses, defining variables, and choosing statistical tests.

The flexibility to modify statistical tests based on data collected during experiments.

The final steps of applying statistical tests, integrating results, and effectively communicating them.

Learning resources suggested for the course, including books and online statistical calculation packages.

Transcripts

play00:02

okay our first chapter is

play00:03

introduction to biostatistics so in this

play00:06

chapter we are going to

play00:07

see that what is biostatistics what is

play00:11

statistics

play00:12

and uh why it is important for you to

play00:14

study this

play00:15

course

play00:18

so let's start from the definition of

play00:20

statistics

play00:22

and statistic is not something alien and

play00:24

statistics is part of life of every

play00:26

serious

play00:27

biologist and not just biologists it is

play00:30

part of everyday

play00:31

life so on everyday basis we have

play00:34

statistical information around

play00:36

us in the form of annual reports

play00:39

various censuses distribution surveys

play00:42

museum records

play00:43

or social media statistics so these are

play00:45

the examples

play00:46

in which we are going to use some sort

play00:48

of statistical information in our daily

play00:51

life

play00:52

and will report about certain uh data

play00:55

about certain

play00:56

records about certain companies for

play00:58

example

play00:59

censuses the population censuses for

play01:02

example

play01:02

distribution surveys so make you make

play01:05

use of information from

play01:07

uh different distribution surveys then

play01:09

museum records they're also a part of

play01:11

statistical

play01:12

information and then the social media

play01:14

statistics like

play01:16

what are the user ratings for a specific

play01:19

app

play01:19

or uh what is the popularity of certain

play01:23

type of post or everything so this is

play01:24

all statistical information that you

play01:26

have in your

play01:27

daily life so it is impossible to

play01:30

imagine life without some sort of

play01:32

statistical information being readily at

play01:34

hand for example if you are going to

play01:36

purchase something you are obviously

play01:38

going to look for a survey that whether

play01:42

this

play01:42

uh what is the feedback of the customers

play01:46

that have already used that product

play01:48

and uh or what is the user rating for

play01:52

that product and where this product is

play01:54

available

play01:55

and all of these information and

play01:58

even when you want to see that if there

play02:01

is

play02:02

a disease which is prevalent in our

play02:03

society or no you're going to

play02:05

see different surveys or the reports of

play02:08

the prevalence reports or the incidence

play02:10

reports

play02:10

so in daily life you're going to use

play02:12

some sort of statistical information so

play02:14

statistics is not something alien and

play02:16

there is no need to be scared of because

play02:18

i know the biology

play02:19

students they're always scared of

play02:20

statistics that it is something alien or

play02:23

it is something mathematical purely

play02:25

and why we are studying this subject so

play02:27

no this is

play02:28

that you are using statistics already in

play02:31

your daily life

play02:35

so the word statistics it is used in two

play02:37

senses

play02:39

it refers to collection of quantitative

play02:41

information

play02:42

and methods of handling that sort of

play02:44

data for example animal reports and

play02:46

censuses

play02:47

so this is one way of the use of word

play02:50

statistics

play02:51

that it refers to collection of

play02:53

quantitative information right so

play02:55

quantitative information means the

play02:56

information which can be presented in

play02:58

numbers in quantities

play02:59

right so this is important that in

play03:01

statistics you're going to make use of

play03:03

the information which is quantitative

play03:06

so it refers to collection of

play03:07

quantitative information that you are

play03:08

collecting the information

play03:10

and as well as the methods of handling

play03:13

that sort of that data that you have

play03:14

just collected the information

play03:16

and how you're going to handle that data

play03:17

how you're going to arrange

play03:19

that data how you're going to describe

play03:21

that data

play03:22

so for example admin reports and

play03:24

censuses so in annual reports also and

play03:26

censuses also you are actually

play03:28

collecting

play03:28

the quantitative information and then

play03:30

you're handling that you are arranging

play03:32

that information in the form of tables

play03:34

or graphs or

play03:35

percentages so this is statistics

play03:39

another way to use statistics is uh to

play03:42

draw

play03:42

inferences about large groups on the

play03:45

basis of observations made on smaller

play03:47

ones right so you have

play03:48

for example a very large population and

play03:51

you want to have some information about

play03:53

that population so how you're going to

play03:54

do that are you going to collect

play03:56

quantitative information from each and

play03:58

every member of the population

play04:00

or are you going to collect information

play04:02

from a smaller number which is a sample

play04:04

and then you are going to make

play04:06

inferences from that sample and then

play04:08

generalizing those inferences on the

play04:10

larger group

play04:11

for example estimating disease

play04:12

prevalence rates

play04:14

so um every time you would have heard

play04:16

that every 10 person in pakistan is

play04:18

suffering from diabetes or every fifth

play04:20

person is suffering from cardiovascular

play04:22

disease and so on

play04:23

so has anyone actually contacted every

play04:26

one of you

play04:26

so i think most of you have never been

play04:29

contacted for

play04:30

those surveys you have never been asked

play04:33

that whether you're suffering from

play04:34

cardiovascular diseases you are

play04:35

suffering from diabetes you are

play04:37

suffering from

play04:38

hepatitis or no but still that data is

play04:40

present for your population

play04:42

in which you are included so how this

play04:44

data is coming

play04:46

so it happens that this information is

play04:48

collected from a smaller group

play04:50

and then that is generalized over the

play04:52

entire population

play04:53

so this is also statistics so usually

play04:56

we can say that word statistics it not

play04:59

just includes the collection of

play05:01

quantitative information

play05:02

but it also refers to drawing inferences

play05:05

from that quantitative information

play05:10

so statistics then is to do with ways of

play05:12

organizing

play05:13

summarizing and describing quantifiable

play05:16

data

play05:17

and methods of drawing inferences and

play05:19

generalizing upon them

play05:21

so we can see that there there are two

play05:24

aspects of this the first one is that it

play05:26

deals with ways of organizing

play05:29

summarizing and describing quantifiable

play05:32

data so we have already discussed that

play05:35

data should be quantifiable

play05:36

that we need data we need information

play05:40

which is in form of quantities or

play05:41

numbers

play05:42

so statistics deals with ways of

play05:44

organizing summarizing and describing

play05:46

this quantifiable data

play05:48

and this part of the statistics is known

play05:50

as descriptive statistics in which

play05:52

you're actually describing

play05:54

that quantitative information that you

play05:56

have collected

play05:58

then in the definition you can also see

play06:00

that there is another part

play06:02

which is methods of drawing inferences

play06:04

and generalizing

play06:05

upon them so what is this method for

play06:08

right so in this case what you're doing

play06:11

is that you're collecting quantitative

play06:12

information

play06:13

and you are drawing inferences from that

play06:15

quantitative information

play06:17

and then you are generalizing that

play06:18

information those inferences upon a

play06:20

larger group

play06:22

and this part of the statistics is known

play06:24

as inferential statistics

play06:26

right so these are two main parts

play06:29

descriptive statistics and inferential

play06:31

statistics so these are two main domains

play06:33

of statistics

play06:36

so descriptive statistic statistics is

play06:39

using data gathered on a group to

play06:40

describe or reach conclusions about that

play06:43

same group only

play06:44

so this is our descriptive statistics so

play06:46

this is another way of

play06:48

defining these two domains of statistics

play06:50

that

play06:51

descriptive statistics is that in which

play06:53

you are using data which is gathered on

play06:55

a group to describe or reach to

play06:58

conclusions about

play06:59

that same group only so you collect

play07:01

information

play07:02

from one group and then you

play07:06

arrange that information and you draw

play07:08

certain inferences from that information

play07:10

and you apply them only

play07:11

only onto the same group so this is

play07:14

descriptive statistics

play07:15

for example you collect information from

play07:17

students

play07:18

of a class right and you

play07:21

collect information about their grades

play07:24

and what you do is you just collect that

play07:26

information

play07:27

and you arrange that information

play07:29

according to the grades that how many

play07:30

students

play07:31

scored a plus how many students scored a

play07:33

how many

play07:35

b or b plus and c and so on so you

play07:37

collected that information

play07:39

and you applied that information only

play07:40

onto that group

play07:42

that this class the bs6 semester if this

play07:44

class is

play07:45

having uh 50 percent students that

play07:49

have b or c grades and there are 20

play07:52

person students that have eight grades

play07:54

and there are 10 percent

play07:55

or 20 percent students that have a plus

play07:57

grades so this is how

play07:59

that you're gathering information from

play08:00

one group and you're

play08:02

using that information for that same

play08:04

group so you're not going to generalize

play08:06

that information about entire campus

play08:08

upon entire university

play08:09

right so this is descriptive statistics

play08:11

in which you are describing the

play08:12

information only for that

play08:14

group and the other part of this the the

play08:17

other domain of the statistics which is

play08:19

inferential statistics

play08:20

is that you use sample data to reach

play08:22

conclusions about the population from

play08:24

which the sample was

play08:25

taken so here we have a group which is a

play08:28

sample we collect information from that

play08:30

group

play08:31

and we apply that information onto the

play08:33

larger group onto the population from

play08:35

which that group was

play08:36

taken so again we can take the example

play08:39

of university students

play08:42

that we want to know the average height

play08:44

of students

play08:45

and let's suppose there are 2000

play08:47

students in the university so is it

play08:49

possible to contact each and every one

play08:50

of them and collect the information

play08:52

about their height or measure their

play08:53

height

play08:54

no so it is going to be very difficult

play08:56

so what we do is that

play08:58

we collect information from let's say

play09:00

200 students we measure their heights

play09:02

and we

play09:03

collect that we randomly select those

play09:05

students and take their heights and

play09:06

we record that data so now we are going

play09:09

to take the data

play09:11

and we are going to generalize the data

play09:13

onto the entire campus

play09:14

so in this way we are taking information

play09:17

from a smaller

play09:18

sub group of the larger group and then

play09:20

we are applying that information onto

play09:23

the entire group so this is the

play09:25

difference between descriptive and

play09:27

inferential

play09:27

statistics

play09:31

so now what is biostatistics so we have

play09:34

studied the definition of statistics

play09:36

that and the two main domains of

play09:37

statistics so now is the question that

play09:39

what is

play09:40

biostatistics so biostatistics is the

play09:43

branch of statistics

play09:45

which is responsible for the proper

play09:47

interpretation of scientific

play09:48

data generated in different fields and

play09:51

what fields

play09:52

the field of biology public health and

play09:55

other health sciences that is biomedical

play09:57

sciences right

play09:59

so all the data which is

play10:02

used in these sciences all the research

play10:05

which is done in these sciences

play10:07

and analyzed through statistical

play10:09

procedures so that comes under the

play10:11

umbrella of

play10:12

biostatistics

play10:16

so why there is a need to of

play10:18

biostatistics so why you are studying

play10:20

this subject

play10:21

right so why is statistics necessary why

play10:24

do you need to have statistical literacy

play10:26

there are two main reasons for this the

play10:28

first one is

play10:30

that statistical literacy is necessary

play10:32

to read

play10:33

understand and evaluate the scientific

play10:34

literature because you're a biology

play10:36

student and you're studying books you

play10:38

are studying scientific books

play10:40

so you need to have a statistical

play10:42

literacy in order to understand that

play10:44

information for example you

play10:48

you encounter a statement like the

play10:50

probability that first year bird

play10:52

will be found in the north sea is

play10:54

significantly greater than for an

play10:55

older one chi-square 4.2 degree of

play10:58

freedom 1

play10:59

probability less than 0.05

play11:02

so how you are going to evaluate this

play11:06

statement unless and until you know that

play11:08

what is meant by this

play11:10

chi-square value what is meant by its

play11:12

degree of freedom

play11:14

and what is meant by the p value which

play11:16

is the probability value

play11:18

so this statement is that probability

play11:21

that a first year bird

play11:23

will be found in the north sea is

play11:25

significantly greater than for an older

play11:26

one so in this case what others want to

play11:29

say is

play11:30

that they have evaluated or they have

play11:33

surveyed

play11:34

the population of birds that is found in

play11:37

the north sea

play11:39

and they were interested in finding out

play11:41

that whether the first year words

play11:43

are common or the older ones are common

play11:46

so they found out

play11:47

that the first-year words are more

play11:49

common as compared to the older ones and

play11:51

this is how they have described their

play11:53

statements

play11:54

so this is a simple statement so you

play11:56

cannot understand or evaluate

play11:57

this statement and unless and until you

play12:00

can see that what kind of the test

play12:02

is used and what kind of information is

play12:04

given so here you can see

play12:06

that there is chi square and guys care

play12:08

is used for the analysis of frequencies

play12:10

so it means that they have reached to

play12:12

this conclusion

play12:14

based on the comparison of frequencies

play12:16

so what they did is they counted the

play12:18

number of the first year birds and they

play12:20

counted the number of the older birds

play12:22

and then they compared their frequencies

play12:24

and by that comparison they came to know

play12:26

that

play12:27

uh their frequencies are different and

play12:30

the first year words are more

play12:32

frequent as compared to the older

play12:35

words so you can understand these kind

play12:38

of statements in your literature only if

play12:40

you have basic statistical

play12:41

literacy so it requires statistical

play12:44

understanding to evaluate such type of

play12:46

claims which you encounter in your

play12:48

scientific

play12:49

literature so this is why the

play12:51

statistical literacy is important

play12:53

that it is important for you to read

play12:55

understand and evaluate the scientific

play12:57

literature so this is the first reason

play12:59

that you need to have some

play13:01

statistical literacy

play13:04

the second reason is that statistical

play13:06

literacy is important to biologists

play13:08

to undertake an investigation on their

play13:10

own account and to present their results

play13:13

in an authoritative

play13:14

form so if you are a biologist and you

play13:17

are doing your own research

play13:18

so in the previous example we saw that

play13:20

when you're studying the scientific

play13:22

literature you're studying others

play13:24

researches and you're trying to

play13:25

understand those researches

play13:27

but when you are going to undertake your

play13:29

own research you are going to perform

play13:31

your own investigation

play13:32

so again you're going to need

play13:34

statistical literacy

play13:37

so a research program must be planned by

play13:40

anticipating the statistical methods to

play13:42

be employed so this is the next concept

play13:44

that

play13:45

whenever you're going to do a research

play13:47

program

play13:48

so it must be planned by anticipating

play13:50

the statistical methods to be

play13:52

employed so this is why it is necessary

play13:54

for you when you are going to carry out

play13:56

your own

play13:57

research so you must know that what kind

play14:00

of the statistical methods you are going

play14:02

to

play14:03

employ you're going to apply on your

play14:05

research

play14:07

and using the statistical treatment as

play14:10

an afterthought to make the results

play14:12

acceptable is not appropriate so

play14:14

whenever you're going to

play14:15

present your results to the scientific

play14:16

community into the scientific papers or

play14:18

scientific conferences

play14:20

so they're going to ask you that how you

play14:22

are going to prove

play14:24

your results what is the um

play14:27

validity what is the statistical

play14:29

authority of these claims

play14:31

so you have to do the statistical

play14:32

analysis every time you're going to

play14:34

present your research or you're going to

play14:36

present your results

play14:38

so for that purpose you need to have

play14:39

statistical literacy and

play14:41

um you need to have um

play14:44

knowledge about statistical literacy

play14:46

well well before

play14:48

uh applying your research right so

play14:51

uh if whenever you're planning your

play14:54

research you must

play14:55

know of the green statistical technique

play14:58

that you are going to employ

play15:00

otherwise if you do the experiment you

play15:03

collect your results and then you think

play15:04

okay so now i have the data now i have

play15:06

the results and

play15:08

what kind of test can i do no that is

play15:10

not an appropriate you can do that but

play15:11

that

play15:12

is not an appropriate way of doing these

play15:14

scientific

play15:15

experiments so whenever you are going to

play15:17

plan an experiment

play15:18

you have to um

play15:21

you have to think of or you have to

play15:22

mention the statistical approach that

play15:25

you are going to use the statistical

play15:26

test or the statistical tool

play15:28

you are going to use for the analysis of

play15:32

your

play15:32

results so what is the role of

play15:36

statisticians

play15:37

so role of statisticians is to guide the

play15:39

design of an experiment or survey prior

play15:42

to data collection again we are talking

play15:43

about

play15:44

free thought that statistics is part of

play15:46

the planning of experimenters not

play15:48

an afterthought so statisticians they

play15:51

are going to

play15:53

guide biologists if they don't have

play15:55

complete statistical literacy so they

play15:57

are going to guide them

play15:58

about the design of an experimental

play16:00

survey prior to data collection

play16:02

and to analyze data using proper

play16:04

statistical procedures and techniques

play16:07

and to present an integrated results to

play16:09

researchers and other decision

play16:11

makers so these are the role of

play16:13

statisticians

play16:14

so they design they they provide a

play16:17

guidance for the designing of an

play16:18

experiment

play16:19

and then when the experiment has been

play16:21

done our survey has been completed

play16:24

then they analyze that data they can

play16:26

analyze those results

play16:28

and then they can present and interpret

play16:30

the results to those researchers and to

play16:33

the decision makers that what is the

play16:35

interpretation of these

play16:37

results so these are the rules of

play16:39

statisticians

play16:42

now we move on to a step-by-step

play16:44

analysis of biological data that how we

play16:47

can

play16:48

we can adopt an approach which is

play16:51

scientifically

play16:53

visible and scientifically valid to do

play16:56

the research

play16:57

so a systematic step-by-step approach is

play16:59

best way to analyze biological data so

play17:01

you are into the sixth semester and in

play17:03

the next year you're going to do your

play17:05

own research

play17:06

so you must have information that how

play17:08

you're going to

play17:09

design your experiment and how you're

play17:11

going to do your

play17:12

research so it is always step by step

play17:15

you cannot start anything at random or

play17:17

haphazardly

play17:18

you have to organize yourself you have

play17:20

to organize a research

play17:22

plan and then you have to follow those

play17:24

steps

play17:27

so the statistical analysis of

play17:28

biological problem it may be broken down

play17:30

into following steps number one

play17:33

to specify the biological question to be

play17:35

answered

play17:36

so first of all you should be clear that

play17:38

what kind of the research are you going

play17:40

to do

play17:40

what is your biological question that

play17:43

you're going to

play17:44

answer so you must be clear about that

play17:46

biological

play17:47

question that you're going to answer in

play17:49

your research so you have to specify

play17:51

that biological question

play17:54

then you have to put the question in the

play17:56

form of a biological null hypothesis and

play17:58

alternate hypothesis

play18:00

right so you are going to you have a

play18:03

biological question and you are going to

play18:05

explain

play18:06

it and you are going to find out the

play18:07

answer for that biological question

play18:09

and for that purpose the scientific

play18:12

method as you must have stated in your

play18:13

previous classes

play18:15

is that for an exp for as

play18:18

for doing the research for addressing a

play18:20

biological question you have to make

play18:22

hypothesis

play18:23

hypothesis is an assumption for that

play18:27

question a likely answer for that

play18:29

question so you make an hypothesis and

play18:30

you test that hypothesis that whether

play18:32

that hypothesis is true or

play18:34

no so you have to put your question in

play18:37

the form of a biological null hypothesis

play18:39

and alternate hypothesis

play18:41

and then you have to put your question

play18:43

in the form of a statistical null

play18:45

hypothesis and alternate hypothesis

play18:47

because once you are done with testing

play18:49

your biological hypothesis

play18:51

you want to see that whether your

play18:53

biological hypothesis should be accepted

play18:55

or not accepted and for that purpose

play18:56

you're going to need statistics

play18:59

so you're going to buy statistical tools

play19:01

and for that purpose you need to

play19:02

formulate

play19:03

the statistical null hypothesis and

play19:05

alternate

play19:08

hypothesis

play19:11

so now the fourth step is to determine

play19:13

which variables are relevant to the

play19:15

question

play19:15

so in the first step you have specified

play19:18

the biological question then in the

play19:19

second step you have specified

play19:21

the biological hypothesis and in the

play19:23

third step you have specified the

play19:25

statistical hypothesis

play19:26

right so in the fourth step you are

play19:28

going to define the variables which you

play19:30

can

play19:31

use for the collection of your

play19:32

information so that you can address your

play19:34

biological

play19:35

question so let's suppose your

play19:37

biological question is that

play19:39

whether smoking increases the chances of

play19:41

lung cancer or

play19:42

no so let's say this is your biological

play19:45

question

play19:46

now to answer that question you have to

play19:48

make biological null hypothesis an

play19:50

alternate hypothesis

play19:51

so we are going to make the biological

play19:53

hypothesis in a way

play19:55

for example you can say that smoking

play19:58

increases the chances of lung cancer or

play20:01

smoking increases the susceptibility of

play20:03

lung cancer

play20:05

and in the statistical hypothesis you

play20:08

can have your statistical null and

play20:10

alternate hypothesis in the statistical

play20:12

null hypothesis you can say

play20:14

that the prevalence of lung cancer is

play20:16

same

play20:17

in smokers and non-smokers and

play20:20

the alternate hypothesis could be that

play20:22

the prevalence of lung cancer is

play20:23

different among smokers and

play20:25

non-smokers so this is how you're going

play20:27

to state your hypothesis

play20:29

then you have to determine that the

play20:31

variables which you can measure

play20:33

to check your hypothesis that whether

play20:35

your hypothesis is true or

play20:37

no so for that purpose what kind of

play20:39

variable are you going to use

play20:41

you are obviously going to look for the

play20:44

smoking history of the

play20:45

individuals suffering from lung cancer

play20:47

individuals are not suffering from the

play20:48

lung cancer

play20:50

so in that case you are going to be

play20:52

individuals and you are going to ask

play20:54

them questions about

play20:55

their smoking history and then

play20:58

you can ask them about their

play21:02

smoking behavior that whether they're

play21:04

smokers or non-smokers

play21:05

or they're ex-smokers and if they're

play21:08

current smokers then

play21:09

how many cigarettes a day they smoke so

play21:12

so you make these variable

play21:14

and then you collect information about

play21:16

those variables from the individuals

play21:19

so this is how you define the variable

play21:20

so once you have information about their

play21:22

smoking history only then you can test

play21:24

your hypothesis

play21:25

so this is what is meant by defining the

play21:27

variables that what kind of information

play21:29

you are going to collect from those

play21:31

individuals right so

play21:34

the next step is to determine what kind

play21:36

of variable each one is that whether you

play21:38

have a quantitative variable or you have

play21:40

a quantitative variable

play21:42

and because we are talking about

play21:43

statistics so we need to have

play21:45

quantitative information we have you

play21:47

need to have quantitative

play21:48

variable which can be presented in

play21:50

numbers so again

play21:52

in that case you're going to see that

play21:54

whether you have the

play21:55

um ordinal variable or you have a

play21:58

continuous variable or you have a

play22:00

discontinuous variable

play22:02

so you don't worry about this definition

play22:03

we are going to check these definitions

play22:06

in

play22:06

the coming chapters so right now i'm

play22:09

just

play22:09

going to describe you that what is meant

play22:11

by the kind of variables so there could

play22:13

be different kinds of variables

play22:15

there could be variables which are

play22:16

measured on a discontinuous scale there

play22:18

are variables which could be measured on

play22:20

a continuous scale

play22:21

there could be variables which are the

play22:25

measurement of something there could be

play22:27

variables which are the counting of

play22:28

something

play22:30

there could be uh variables which are

play22:32

measured on

play22:33

an ordinal scale on a ratio scale or on

play22:36

an interval scale

play22:37

so we are going to define that what type

play22:39

of variable is this one

play22:42

then in the sixth step uh based on the

play22:45

number of variables the kind of

play22:47

variables and the hypothesis to be

play22:48

tested then you have to choose the best

play22:50

statistical test

play22:51

to use there are a number of statistical

play22:53

tests that are available but not

play22:55

everyone is suitable for

play22:57

the kind of variable that you have the

play22:59

kind of data you have

play23:01

so you have to find out that which kind

play23:03

of statistical test is going to be good

play23:05

for your data for example we were

play23:06

talking about the

play23:08

relationship between smoking history and

play23:11

the prevalence of lung cancer

play23:13

so for that kind of test we can um

play23:16

for that kind of analysis we can choose

play23:18

the chi-square test of independence and

play23:20

we can just check that

play23:22

what is the differences in their

play23:24

frequencies so again it depends on the

play23:26

kind of the

play23:27

variables and the kind of your problem

play23:29

and the kind of your hypothesis that

play23:31

what kind of statistical test you are

play23:33

going to

play23:34

choose now do the experiment now you see

play23:38

that this is the time to do the

play23:40

experiment so before that you have to

play23:41

plan all these things you have to plan

play23:43

your question you have to plan

play23:45

you have to make or design specific

play23:48

hypothesis

play23:49

and then you have to define that which

play23:51

variables you are going to collect your

play23:53

during your data collection or doing

play23:55

your experiments

play23:57

and then you also have to define that

play23:59

what kind of the statistical tests you

play24:01

are going to employ

play24:02

and only then you're going to do the

play24:04

experiment right

play24:06

so that was all pre-planning and

play24:08

statistics was the part of that planning

play24:11

of your experiment

play24:13

so now you do the experiment

play24:16

and now you have done the experiment or

play24:19

you have collected your

play24:20

observations or informations now you

play24:22

have to examine the data to see if it

play24:24

needs the assumptions of the statistical

play24:26

test you

play24:27

chose if it doesn't you should choose a

play24:30

more appropriate test

play24:31

for example uh you are studying the

play24:34

relationship between smoking and

play24:36

the susceptibility of lung cancer

play24:39

right so you define your variable that

play24:42

you are going to

play24:43

take the smoking history from the

play24:46

individuals who are suffering from lung

play24:48

cancer and the religious were not

play24:49

suffering from the lung cancer and then

play24:51

you are going to compare them

play24:52

through the guys care test of

play24:54

independence right

play24:56

but when you were doing uh the data

play24:58

collection when you were

play25:00

surveying those individuals you came to

play25:02

know that

play25:03

there is another factor that can

play25:05

contribute to that

play25:07

susceptibility and that factor is let's

play25:09

say age

play25:10

that the lung cancer is more common in

play25:12

older individuals as compared to the

play25:14

younger individuals

play25:15

so in that case we're going to add

play25:17

another variable

play25:19

which could be a contributing factor in

play25:21

susceptibility as

play25:23

well right so in that case what you're

play25:26

going to do is

play25:27

that if your uh the statistical that

play25:30

test that you chose

play25:31

whether that test is allowing you the

play25:34

comparison of multiple

play25:35

variables or is it not allowing you the

play25:38

comparison of multiple variables so if

play25:40

it does then you can use the same test

play25:42

and if it doesn't you can choose a more

play25:44

appropriate

play25:45

test right so it is flexible

play25:48

that you have to include the information

play25:51

about statistical tests into your

play25:53

planning

play25:54

but if due to some unavoidable reasons

play25:56

if

play25:57

due to some um other unforcing reasons

play26:01

you have to add or remove the variables

play26:03

then you have to modify your statistical

play26:05

test as

play26:06

well and you can do that

play26:09

now you can apply your chosen

play26:11

statistical test and then you can

play26:13

integrate your

play26:14

results and then once you have

play26:18

reached your results then it is the time

play26:20

to communicate your results effectively

play26:22

and usually with a graph or table and

play26:24

again statistics is going to help you

play26:26

and that that how you are going to

play26:28

organize so remember the definition

play26:29

statistics

play26:30

that it deals with the ways of

play26:32

collecting organizing

play26:34

and describing the quantitative

play26:36

information

play26:37

so statistics helps you in collecting

play26:40

the information

play26:41

and organizing that information and it

play26:43

also helps you in the

play26:45

representation of that information that

play26:47

how you're going to present that

play26:48

information

play26:49

what kind of the tables are suitable

play26:51

what kind of the graphs are suitable for

play26:53

presenting your

play26:54

data so a quick review of

play26:58

the step-by-step approach to the

play27:00

analysis of biological data

play27:02

so that you have a biological question

play27:03

that you want to answer

play27:05

you plan an experiment and the planning

play27:06

of experiment involves uh formulating

play27:09

the biological hypothesis formulating

play27:11

the statistical hypothesis

play27:12

defining the variables choosing the

play27:14

statistical tests

play27:16

and then you do the experiment so after

play27:18

planning you do the experiment and you

play27:20

get your

play27:21

data you get your information so what is

play27:23

the next step you're going to analyze

play27:25

that data

play27:26

and again you're going to make use of

play27:28

statistics for analyzing your data and

play27:30

getting your results

play27:31

and then the next step is to present

play27:33

your results so this is the summary of

play27:35

um of the steps that you can adopt

play27:39

to carry out your research that you have

play27:42

to first identify your biological

play27:43

question

play27:44

then you have to plan your experiment

play27:46

then you do the experiment

play27:48

and you analyze your results and then

play27:49

you present your results

play27:51

right so this is how you do it so this

play27:54

is the summary of

play27:55

the research design or the research

play27:58

planning

play27:59

that you can employ

play28:03

and now the learning resources for your

play28:06

course

play28:07

so um i'm going to suggest a couple of

play28:10

books

play28:11

the first one is fowler and cohen this

play28:13

is a very good book and very excellent

play28:15

book for the beginners who don't have

play28:19

background of statistics as of as

play28:22

discipline or as a subject just like the

play28:24

biologist so we don't have a background

play28:27

of

play28:27

statistics so this book is very much

play28:30

relevant for us

play28:31

so this is a guide for practical

play28:33

statistics for field biology

play28:35

and it addresses the basic questions and

play28:37

it gives the

play28:39

examples from biology so most of the

play28:40

content of this course that i am going

play28:42

to discuss with you comes from this book

play28:44

so this is a very good book and then

play28:47

there is a book which is available

play28:48

online

play28:49

and you can download the pdf or from the

play28:52

link which is given here

play28:54

and this book is by mcdonald and this is

play28:56

handbook of

play28:58

biological statistics so again this is a

play29:00

very basic book

play29:01

and very easy to understand and you can

play29:04

just download

play29:05

the pdf version of this book from this

play29:08

link

play29:10

and then there are some online

play29:12

statistical calculation packages that

play29:14

you can use they're free to use

play29:15

and you can use them for statistical

play29:17

analysis in your research when you're

play29:19

going to do research in your next

play29:21

semester

play29:22

staff pages.org social statistics

play29:25

statistics.com or vesserstats.net

play29:28

right so these are some of the online

play29:29

statistical calculation packages that

play29:31

you can

play29:32

use

Rate This

5.0 / 5 (0 votes)

Étiquettes Connexes
إحصاء حيويعلم الALTHال生物科技البيانات الكميةتحليل البياناتال_NULL假设ال实验中学لالتحليل البيانيالاستنتاج ال演绎التعلم التفاعلي
Besoin d'un résumé en anglais ?