JiscMail Logo
Email discussion lists for the UK Education and Research communities

Help for ALLSTAT Archives


ALLSTAT Archives

ALLSTAT Archives


allstat@jiscmail.ac.uk


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

ALLSTAT Home

ALLSTAT Home

ALLSTAT  March 2007

ALLSTAT March 2007

Options

Subscribe or Unsubscribe

Subscribe or Unsubscribe

Log In

Log In

Get Password

Get Password

Subject:

RSS General Applications Section Meeting on Data Mining

From:

"Marriott, John" <[log in to unmask]>

Reply-To:

Marriott, John

Date:

Tue, 20 Mar 2007 08:49:46 -0000

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (98 lines)

Royal Statistical Society
General Applications Section

Meeting on Data Mining, 11 April 2007 at the RSS, 12 Errol Street,
London EC1Y 8LX

14.00 Professor John Keane, University of Manchester

Developing data mining algorithms

14.45 Professor Mark Girolami, University of Glasgow

Bayesian activity profiling in scoring fraudulent telephone usage

15.30 Tea

16.00 Professor David Hand (Imperial College)

What you get is what you want? Some dangers of black box data mining

16.45 Panel discussion

17.30 Close of meeting

This meeting is sponsored by the Southampton Statistical Sciences
Research Institute.

For further details, please contact the meeting organisers: Alan Kimber
([log in to unmask]) or John Marriott
([log in to unmask])


Abstracts

John Keane, University of Manchester

Developing data mining algorithms

This talk will consider involvement in data mining over more than a
decade, across a wide variety of applications including statistical
disclosure, bioinformatics, medical diagnosis, pedestrian detection and
electricity forecasting. The technical focus will be on a number of
recent activities across a range of algorithmic approaches:
classification, rough sets, item set mining, neuro-fuzzy systems,
hierarchical fuzzy systems and hierarchical hybrid systems. 

Mark Girolami, University of Glasgow

Bayesian activity profiling in scoring fraudulent telephone usage

This talk considers a Bayesian solution to the problem of scoring &
identifying anomalous, and therefore potentially fraudulent, usage of a
telephone service by individual accounts. Creating and maintaining
account specific profiles, which are represented by discrete
multivariate probability distributions, provides a consistent and highly
practical means of scoring and ranking individual service usage in terms
of potential malfeasance. The computational overhead of the proposed
method is such that millions of account profiles can be stored and
maintained so that tens of millions of transactions can be scored on a
daily basis using standard computing capabilities. A commercial
prototype system based on this proposed methodology has been developed
by Memex Technologies and evaluated in an operational environment by the
telecom operator NTL. 

Such model-based profiling methods rely on a compact generative
representation of the sequential activity of a number of individuals
within a population in which case there is a tradeoff between the
definition of individual specific and global models. A linear-time
distributed model for finite state symbolic sequences representing
traces of individual user activity is considered by making the
assumption that heterogeneous user behaviour may be 'explained' by a
relatively small number of common structurally simple behavioural
patterns which may interleave randomly in a user-specific proportion.
The results of empirical studies related to telephone usage and web
browsing behaviour will be presented and indicates that this modelling
approach provides an efficient representation scheme, reflected by
improved prediction performance as well as providing low complexity and
intuitively interpretable representations.

David Hand, Imperial College

What you get is what you want? Some dangers of black box data mining
 
Descriptions of data mining tools, and examples of them in action,
generally assume that the truth is out there, is fixed and invariant,
and can be discovered, at least in principle. The fact is, however, that
life is often more complicated than this suggests.  Sometimes shaky
assumptions cast serious doubt on one's conclusions.  Sometimes the
exciting knowledge nuggets are irrelevant in the context of the bigger
picture. Even worse, however, sometimes the very data mining exercise
itself has consequences which render its own results invalid.  Data
mining is a powerful technology for profit and progress, but careless
use of any powerful technology can have serious adverse consequences. 



This email is intended solely for the addressee.  It may contain private and confidential information.  If you are not the intended addressee, please take no action based on it nor show a copy to anyone.  In this case, please reply to this email to highlight the error.  Opinions and information in this email that do not relate to the official business of Nottingham Trent University shall be understood as neither given nor endorsed by the University.
Nottingham Trent University has taken steps to ensure that this email and any attachments are virus-free, but we do advise that the recipient should check that the email and its attachments are actually virus free.  This is in keeping with good computing practice.

Top of Message | Previous Page | Permalink

JISCMail Tools


RSS Feeds and Sharing


Advanced Options


Archives

April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
2006
2005
2004
2003
2002
2001
2000
1999
1998


WWW.JISCMAIL.AC.UK

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager