JiscMail Logo
Email discussion lists for the UK Education and Research communities

Help for CCP4BB Archives


CCP4BB Archives

CCP4BB Archives


CCP4BB@JISCMAIL.AC.UK


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

CCP4BB Home

CCP4BB Home

CCP4BB  July 2013

CCP4BB July 2013

Options

Subscribe or Unsubscribe

Subscribe or Unsubscribe

Log In

Log In

Get Password

Get Password

Subject:

Re: ctruncate bug?

From:

Douglas Theobald <[log in to unmask]>

Reply-To:

Douglas Theobald <[log in to unmask]>

Date:

Thu, 25 Jul 2013 12:07:54 -0400

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (286 lines)

On Jul 13, 2013, at 5:36 PM, Ian Tickle <[log in to unmask]> wrote:

> On 8 July 2013 18:29, Douglas Theobald <[log in to unmask]> wrote:
> 
> > > Photons only have a Poisson distribution when you can count them:
> > > QM says it meaningless to talk about something you can't observe.  
> >
> > Aw, come on --- QM is a theory, it says no such thing.  The claim
> > that "it's meaningless to talk about something you can't observe" is
> > a philosophical principle, not science.  There are many
> > interpretations of QM, some involving hidden variables, which are
> > precisely things that exist that you can't observe.  Heck, I'd say
> > all of science is *exactly* about the existence of things that we
> > only infer and cannot observe directly.  Remember, when you get the
> > readout from a detector, you are not directly observing photons even
> > then --- you are formally inferring things that you can't observe.
> > There's a whole chain of theory and interpretation that gets you
> > from the electronic readout to the claim that the readout actually
> > represents some number of photons.
> 
> In science we make a clear separation of what is 'data' from what is
> 'model' (or 'inference').  

Often we do, but it is not all that clear cut.  Especially in Bayesian
analyses, "data" and inference are often interchangeable. One man's
inference is another man's data.  As a very pertinent example, the true
intensities are inference, but in terms of the Wilson distribution they
are data (p(J|sdw)).  

> Of course one can be pedantic and argue that everything is really
> inference since our brains interpret everything that goes on 'out
> there' by means of inference from its sensory inputs. Obviously I
> don't take seriously the premise of the 'Matrix' movies (excellent
> though they may be!) that such inputs are just a simulation!  

You can accept that everything is inference without slipping down the
slope to the invalid conclusion that the Matrix is true.  

> At some point you have to believe what your eyes are telling you, as
> long as there is a clear chain of believable cause-and-effect between
> the observation and the inference of that observation.  However, we
> are rightly suspicious of any model that is not supported by data (in
> fact inference in science requires data). Note that I always mean
> 'observation' to be synonymous with 'measurement' (as in 'Fobs'), not
> 'observation' in the weaker sense of 'seeing'.

So you agree that all scientific measurements/observations, aside from
trivially "seeing" something, are indirect.  My argument is that you can
in fact measure (or estimate) the background under the intensity using
the model:

Is = Ib' + Ij

Ib

where we experimentally measure Is (the spot) and Ib (the background
around the spot), and we assume that both Ib' and Ib come from the same
(Poisson) distribution with mean B, and that Ij (a sample from the true
spot intensity J) comes from a Poisson as well.

Given that model, you can actually measure Ib' and Ij.  It's an indirect
measurement, but all measurements/observations are.  Here there's really 
no conceptual difference from how we measure photons with a detector:
we assume some model for how photons interact with our detector and we
behave as if we are measuring photons with the output from the detector
(in your words, the model is a "clear chain of believable
cause-and-effect"). All (non-seeing) measurements are ultimately
indirect and model-based, a form of inference.  Your claim that we can't
measure Ib' or Ij is based on the your rejection of the above model ---
but it is a circular argument to reject that model by saying we can't
measure Ib' or Ij.

Bringing in QM is of no help, since QM is consistent with (non-local)
hidden variable interpretations where photons are in fact
distinguishable and exist as real things with definite physical
properties before we measure them.  

> > Again, this is your own personal philosophical interpretation of QM
> > --- QM itself says nothing of the sort.  For instance, Bohm's pilot
> > wave interpretation of QM, which is completely consistent with
> > observation and QM theory and calculation, states that individual
> > photons *do* go through one slit or the other.  But this is really
> > off point here, I think --- as I said, I don't want to get into a QM
> > debate.
>
> Neither do I, I would just observe that under no QM interpretation can
> you determine which slit any individual photon went through and I
> would argue that it's therefore not even a meaningful question to ask
> which slit it went through.

Again, that is one of many possible QM philosophies.  I think, however,
that if we are to ever find a better theory than QM, then we must ask
questions like "which slit did the photon go through".  Otherwise we
have a science-stopper. Just because QM, in its present form, can't tell
us which slit a photon went through does not logically imply that the
photon did not go through a slit.  

> > I disagree.  Following that logic, we could not talk about the error
> > in our estimate of the gravitational force on the Earth from the
> > Moon (because, if our theories of gravity are correct, the force
> > exerted on the Earth is the sum of the gravitational pull of all
> > massive objects in the universe, and it is physically impossible for
> > us to, say, remove the Sun and then independently measure the force
> > exerted by the Moon).
> 
> The estimate of the gravitational force on the Earth from the Moon is
> an inference, not an observation: it relies on Newton's theory of
> gravitation being correct. Of course I'm not denying that you can talk
> about the error estimate for an inference, but I was talking about
> errors in observations (= measurements).  

As I explained above, *all* observations/measurements rely on some model
or theory being correct, and all observations/measurements are indirect
in this sense.  There really is no practical difference between
inference and observation.  All non-trivial observations/measurements
rely on inference.  

> It's physically impossible to move the Sun, but not logically
> impossible (i.e. we can imagine it happening).  In contrast separating
> Ispot into its component Itrue & Iback is a logical impossibility; we
> can't even imagine it without allowing a logical contradiction: since
> photons are indistinguishable the same photons in Ispot have to be
> blocked and not blocked at the same time.  

There is no logical contradiction in imagining that photons are
distinguishable --- the idea that photons are (irreducibly)
indistinguishable is based on a particular interpretation of one
particular physical theory, not logic. If history is any guide, all of
our present theories are wrong.  There very well may be a better theory
than QM in which we can label individual photons.  How is that illogical?

In any case, Bohm's interpretation of QM is observationally equivalent
with QM theory, and in Bohm's interpretation, photons have identity
(i.e., they are identical in the classical sense, but not irreducibly
indistinguishable).  Since Bohm's interpretation is physically possible,
then it is also logically possible.  

As an aside, I note that physicists (and the rest of us) don't really
act like photons are indistinguishable.  We all talk very comfortably
about the photons hitting my eyes from my LCD here in Waltham and those
other photons hitting the ADSC detector at the SIBYLS beamline in
Berkeley.  

> I contend that if a 'measurement' is a logical impossibility (but not
> necessarily a physical one) then it's meaningless to talk about an
> error distribution for such a 'measurement'.
> 
> > My point is, however, that F&W implicitly assumes that the
> > background we measure (in Ib) comes from the same process as the
> > background under the spot (Ib').  In other words, the underlying
> > model is:
> >
> > Is = Ib' + Ij
> >
> > Ib
> >
> > where we experimentally measure Is (the spot) and Ib (the background
> > around the spot), and we assume that both Ib' and Ib come from the
> > same (Poisson) distribution, and that Ij (a sample from the true
> > spot intensity J) comes from a Poisson as well.

> >
> > I agree that Is-Ib is approximately Gaussian for reasonably large Is
> > and Ib. The problem is P(Is-Ib | J,sdJ).  Why is the mean of this
> > Gaussian J?  Why not J^2, or J/4?  How do you derive the Gaussian
> > P(Is-Ib |J,sdJ)?  The claim that the mean of Is-Ib is J requires
> > some sort of justification --- otherwise I might as well claim that
> > the mean of Is-Ib is the number of quarters in my pocket.
> > 
> > We both agreed that Is and Ib are dependent, and so that necessarily
> > means that P(Is-Ib | J,sdJ) cannot be the whole story.  Dependence
> > means that the distribution of Is-Ib will also be a function of B
> > and sdB, unless you've integrated them out somehow (and even then
> > the form of the marginalized distribution depends on B and sdB).
> > 
> > F&W assumes that P(Ij|J,sdJ) is Poisson.  If you think F&W doesn't,
> > then explain how you get the Gaussian P(Is-Ib | J,sdJ).  Again, why
> > is the mean of Is-Ib equal to J?  I gave one derivation, based on
> > the model that P(Ij|J), P(Ib|B), and P(Ib'|B) are all Poisson.  If
> > you disagree, then what is your derivation of the Gaussian P(Is-Ib |
> > J,sdJ)?
> 
> OK let's go through it step-by-step and you tell me where I went
> wrong. Note that I do not assume a prior P(Ij|J,sdJ), in particular I
> do not assume that it's Poisson, or even that Ij or P(Ij|J,sdJ) exist:
> I contend that Ij is certainly not observable.
> 
> 1. The true values of the Bragg & background components of a spot are
> J and B, so the true total coherent scattering is S = J + B.
> 
> [Note: true values are unknown constants, which obviously do not have
> uncertainties, i.e. if we were doing a simulation we might assume say
> J = 10, B = 20; there's no assumption of P(Ij) here.  Of course
> estimates of true values do have uncertainties, but the Wilson prior
> P(J), the posterior P(J|Is,Ib) and hence <J> and var(<J>) only come in
> later after we invoke Bayes' theorem in F & W.]

So I find this a bit odd, as it seems to me we are saying the same
thing, but you evidently see a difference.  You accept that S=J+B is
a valid model, which is very similar to the model I've been proposing
(in fact to me it appears identical, except that you deny a distribution
for Ij).  But you also claim that we can never measure J or B (where B is
by definition the background under the true intensity that combines to
form S). Yet you say that "it's meaningless to talk about something you
can't observe".  What gives?  This seems contradictory to me --- where
am I misunderstanding you?

A few words about Bayesian vs frequentist interpretations of
probability.  From a Bayesian POV, true values are in general not
constants and they have uncertainties.  (A constant is simply a random
variable that has such small uncertainty that its distribution is a
delta function with mean equal to the true value.) The Bayesian view of
probability holds that uncertainties are only in our minds;
uncertainties (or probabilities) are not properties of physical objects.
Thus, from a Bayesian POV, it is completely natural (and necessary) to
assign a probability distribution to any unknown value, like B and J. In
frequentism it might not make any sense, but here we are discussing a
Bayesian method (F&W), not a frequentist one.  Perhaps this is actually
the root of our disagreement. 

> 2. We make a measurement of S and get Is.  If we made multiple
> measurements we would expect to see (say) a Poisson distribution, in
> which case for a single measurement we can estimate var(S) = S ~ Is.

OK.

> 3. We also estimate B from the adjacent background Ib and similarly we
> estimate var(B) = B ~ Ib.

OK.

> 4. So now we have estimates of S (~ Is) and B (~ Ib), so therefore we
> have an estimate of J = S - B (~ Is - Ib).

Now, you agree that Is is Poisson with mean S, and that we can measure
Is.  You also agree that Ib is Poisson with mean B, and that we can
measure Ib.  You also assume that S = J + B.  Yet, it is a well-known
statistical fact that a Poisson with mean S=J+B is equivalent to the sum
of two Poissons, with means J and B respectively.  You can easily prove
this using moment generating functions.  So your model here and my model
are mathematically equivalent.  The existence of P(Ij|J) and Ij is 
mathematically implied by your assumptions, and Ij is necessarily
Poisson.  

> 5. We have established that the difference of 2 Poissons with means S
> and B is well-approximated by a Gaussian with mean (S-B) = J and
> variance (S+B).

Not so fast.  The difference of two Poissons is a Skellam distribution.
If the means of the two Poissons are S and B, then the Skellam has two
parameters S and B.  When S and B are equal, the Skellam distribution
very quickly approaches a Gaussian as S and B get larger.  This is
basically equivalent to the result that Randy got with his Mathematica
plots. However, when S and B are unequal, the Skellam is always skewed
(if S>B, right skewed, if S<B, left skewed).  If S and B are very
different (say one is 5 times the other), then the approach to a
Gaussian is much slower.  So now that I've talked this out, I think 
I've pinpointed where the Gaussian approximation comes in with F&W 
--- it's not with the Poissons, as I thought, but with the Skellam.  

And now I've gone back and reread F&W 1978 more carefully; they 
apparently make this very argument and cite Skellam:

"Firstly, whilst we accept the data are certainly not exactly normally
distributed, we do contend that the normal distribution is an adequate
approximation for our purposes. From a theoretical point of view we are
encouraged in this belief, since the merged intensity I is made up of
sums of differences of theoretically Poisson-distributed counts. Such
operations on Poisson variables reduce them to normality quickly (Irwin,
1937; Skellam, 1946)." p. 518.

(Note that what they call the "merged intensity I" is what we are
calling Is-Ib.)

> 6. So P(Is-Ib,J) is Gaussian with mean J, and var(J) = (S+B) ~ (Is+Ib).
> 
> I don't see what's wrong with that.

So I don't really see anything wrong with that, except that you missed
the fact that your derivation is mathematically identical to mine, since
your assumptions necessarily imply Poisson P(Ij|J).  

Cheers,

Douglas

Top of Message | Previous Page | Permalink

JiscMail Tools


RSS Feeds and Sharing


Advanced Options


Archives

April 2024
March 2024
February 2024
January 2024
December 2023
November 2023
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
December 2020
November 2020
October 2020
September 2020
August 2020
July 2020
June 2020
May 2020
April 2020
March 2020
February 2020
January 2020
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007


JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager