JISCMail - PHD-DESIGN Archives

Email discussion lists for the UK Education and Research communities

Subscriber's Corner

Email Lists

PHD-DESIGN Archives

PHD-DESIGN@JISCMAIL.AC.UK

View:

Message:

[

First

Last

]

By Topic:

[

First

Last

]

By Author:

[

First

Last

]

Font:

Proportional Font

		LISTSERV Archives
		PHD-DESIGN Home
		PHD-DESIGN February 2017

Options

Subscribe or Unsubscribe

Get Password

Subject:

Re: Epistemological Differences -- Research in an Academic Discipline vs. Research in a Professional Practice

From:

Terence Love <[log in to unmask]>

Reply-To:

PhD-Design - This list is for discussion of PhD studies and related research in Design <[log in to unmask]>

Date:

Fri, 24 Feb 2017 09:25:54 +0800

Content-Type:

text/plain

Parts/Attachments:

text/plain (144 lines)

Dear Ali,

Thank you for your message. I enjoyed reading your analysis, and I suspect we agree on most things. My apologies for being slow to reply.

I very much agree with you that in  the general picture of statistical prediction better results are gained with a larger number of data points - i.e. large N than small N.

My interest, however,  is in identifying the behaviour of situations  when this does not apply. In part this is useful because it identifies structurally weaknesses in statistical analyses and in part because it offers opportunities for simpler  and better analyses.

An  example: during the 1936  US presidential elections, the Literary Digest polled the US population and gained 2.4 million returns which on statistical analysis predicted Landon would become president with 55% of the votes compared to 41% for his main rival. In fact the outcome was that Roosevelt won with 61% of the votes compared to Landon's 37% - as predicted by Gallup using only 3000 returns. Participant selection bias  was more important than having an N that was 8000 times larger. (Harford, T (2014) Big Mistake. Financial Times 29 March P18).

Harford also discusses Ioannidis' work on why most large N statistically-based research findings are false (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1182327/). Many large N statistical research projects of the sort of interest in  design research suffer from the problems Ioannidis identified.

On the other hand, I think it is worth exploring how to get valid  predictive outcomes from low N statistical research. The way I see it in terms of low and high N, is that the quality  of statistical prediction depends on four  elements:

1.  The kind of behaviour of the phenomenon to which the statistical analysis applies;

2. Information known about the details of the behaviour of the phenomenon and its context;

3. The quality of empirical data about the phenomenon and context (i.e the quality, relevance, accuracy etc. of each data point); and,

4. The amount of empirical data about the phenomenon (the size of N).

I suggest 1, 2 and 3 are a  balance for 4. That is, better information  on 1, 2 and 3 means less need for large N. 

A thought experiment... A very simple statistical analysis .....Imagine a stationary sealed container with a fluid  inside it and you want to  know the  depth of the fluid. How many data points do you need?  One.

Of course if  the container is being continually shaken, then you need more data points. How  many more data points you need, however, also depends on 1. And 2. (knowledge of  the behaviour of the fluid and the context. A thick viscous fluid and minimal  shaking will require low N. In contrast, a thin fluid and lots of irregular shaking will require more N. Now interestingly, meta-analysis of the figures from the  data points will also inform  1, 2, and 3. And if done well that may help reduce the  N needed for sound prediction.

Alternatively, the question of the size of problem can be seen as one of curve fitting - with much the same analysis re the size of N.

My  two penneth....

Warm regards,
Terry

==
Dr Terence Love 
FDRS, AMIMechE, PMACM, MISI, MAISA
Director
Design Out Crime & CPTED Centre
Perth, Western Australia
[log in to unmask] 
www.designoutcrime.org 
+61 (0)4 3497 5848
==
ORCID 0000-0002-2436-7566




-----Original Message-----
From: [log in to unmask] [mailto:[log in to unmask]] On Behalf Of Ali Ilhan
Sent: Saturday, 18 February 2017 4:28 AM
To: PhD-Design - This list is for discussion of PhD studies and related research in Design <[log in to unmask]>
Subject: Re: Epistemological Differences -- Research in an Academic Discipline vs. Research in a Professional Practice

Dear Terry,



If I am understanding what you wrote correctly, I have some points of disagreement. You wrote:



-snip-



*“To recap, seen over-simplistically in terms of the research methods of large-sample size, associative, single-question analysis, the use of small samples in design research may appear to be faulty.*


*However, seen in terms of methods of analysis that make use of the rich complex body of information available in each datum and between data, small sample analysis methods can be both more reliable and offer better
information.”*

-snip-



I don’t think that you can say that small sample methods can be inherently more reliable and offer better information. It all depends on the research question, and all methods have their epistemological and technical problems. If your goal is to capture variance in a human population, regardless of the circumstances, you need a large enough N (whether the analysis is associative or single-question is entirely a different matter).





You are writing as if modern statistics is a monolithic entity. It is not.
There are methods that are better suited for explaining variation within units versus methods that focus on population averaged effects versus methods that focus on auto-correlation between longitudinal measurements and so on. There are methods which rely heavily on hypothesis testing and on the other hands there are methods that give primacy to what “the data has to say” without any apriority assumptions. Some methods rely on a-priori probability distributions while others do not. Some methods are more suitable for single question set-ups while others can tackle more complex designs. Each have their strengths and weaknesses, and each has its own usage.





Going back to the example you gave about exam marks, Rasch Anlysis is only appropriate under certain circumstances. If you are interested, for example, in difference between test scores between classrooms or between schools districts, or if you want to decompose how much of the difference in test scores stem from within school variance vs. between school variance Rasch Analysis will not do the trick, you will need large samples.



About usability testing and sub-atomic particles: Small Ns are probably good for identifying problems in a product (software, system you name it), precisely because you are trying to elicit information about a single product, and typically between unit variation in non-living things is extremely low (copies of your software will more or less be identical until you install them on different computers). But if you wanted to learn more about your users, not your products, again small Ns may not help. Once again, there is too much between and within unit variation. That is why you cannot use the models you use for sub atomic particles for humans. We have (at least for some type of particles) reliable theories that help us to predict behaviors of certain particles in a probabilistic manner. Yet we are unable to predict where the next revolution or economic crisis will happen, or when my next nervous breakdown will be even probabilistically.
Humans are inherently more complex than David’s research participants, and there is no math that I know of that can explain or predict their behavior J



You also wrote:



*--snip--*

*"There is increasing criticism of statistical analysis of big data as resulting in false findings. I've an excellent paper on this but can't put my hand on it at the moment."*

*--snip—*



Big data is typically not analyzed with traditional statistical methods.
Machine learning and data mining involve statistics and there are substantial overlaps, but they are very different approaches. The latter two focus on the data and emergent patterns, while traditional statistics approach data to answer pre-set questions that stem from theory. That said, you can find similar criticisms about any method, regardless of the N they rely on. Big data can be analyzed in good or bad ways, like any other data.
I don’t think there is an area of research that is devoid of false findings. Some false findings, however, are harder to catch. For most qualitative research, it would take immense time and effort to try to replicate the results, so it is even harder to find the problems in such approaches (I am not implying that quant. research is better).



To summarize, I am not trying to say that big Ns are essentially better than small Ns. But each approach has its own niche, depending on the questions we are asking.



Warm wishes,



Ali



PS. I wrote this in a haste, so please excuse me for typos and other grammatical issues.


-----------------------------------------------------------------
PhD-Design mailing list  <[log in to unmask]> Discussion of PhD studies and related research in Design Subscribe or Unsubscribe at https://www.jiscmail.ac.uk/phd-design
-----------------------------------------------------------------


-----------------------------------------------------------------
PhD-Design mailing list  <[log in to unmask]>
Discussion of PhD studies and related research in Design
Subscribe or Unsubscribe at https://www.jiscmail.ac.uk/phd-design
-----------------------------------------------------------------

Top of Message | Previous Page | Permalink

JiscMail Tools

Files Area | help

RSS Feeds and Sharing

Search Archives

Advanced Options

Archives

May 2024
April 2024
March 2024
February 2024
January 2024
December 2023
November 2023
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
December 2020
November 2020
October 2020
September 2020
August 2020
July 2020
June 2020
May 2020
April 2020
March 2020
February 2020
January 2020
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
2006
2005
2004
2003
2002
2001
2000
1999
1998

JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk