JISCMail - AI-SGES Archives

[Please accept our apologies if you receive multiple copies of this emails]

===========================================================================
      The First International Workshop on Mining and Understanding 
                     from Big Data (BigMUD 2013)
                (http://kdd.csd.uwo.ca/BigMUD.html)

                         In conjunction with
        IEEE International Conference on Data Mining (ICDM 2013)
                    (http://icdm2013.rutgers.edu/)

                  Dallas, TexasĄ¤December 8-11, 2013

(Distinguished papers presented at the workshop, after further extension and 
 revision, will appear in a Special Issue of the SCI-indexed Journal
   - Journal of Computer Science and Technology (JCST), Springer- 
       http://jcst.ict.ac.cn:8080/jcst/EN/volumn/home.shtml)
============================================================================

Introduction:
-------------

Big data refers to datasets that exceed the competence of commonly used 
IT systems in terms of processing space and/or time. Traditionally, 
massive data are mostly produced in scientific fields such as astronomy, 
meteorology, genomics physics, biology, and environmental research. Due 
to the rapid development of IT technology and the consequent decrease of 
cost on collecting and storing data, big data has been  generated from 
almost every industry and sector as well as governmental department, 
including retail, finance, banking, security, audit, electric power, 
healthcare, to name a few. Recently, big data over the Web (big Web data 
for short), which includes all the context data, such as, user generated 
contents, browser/search log data, deep web data, etc., have attracted 
extensive interests, as these context data and their analyses help us to 
understand what is happening in real life. This can help to enable new 
ways for improving user experience by providing more accurate predictions 
and recommendations thus creating a personalized smarter internet.

Currently, big data is often on the order of petabytes and even exabytes. 
However, big data has become bigger and bigger not only in its size, but 
also in its growth rate and variety. The volume of big data often grows 
exponentially or even in rates that overwhelm the well-known MooreĄŻs Law. 
Meanwhile, big data has been extended from traditional structured data 
into semi-structured and completely unstructured data of different types, 
such as text, image, audio, video, click streams, log files, etc. 
Moreover, big data is often internally interconnected and thus form complex 
data/information networks.

Although big data can offer us unprecedented opportunities, they also pose 
many grand challenges. Due to the massive volume and inherent complexity, 
it is extremely difficult to store, aggregate, manage, and analyze big data 
and finally mine valuable information/knowledge from the complex data/
information networks. Therefore, in the presence of big data, the models, 
algorithms and methods for traditional data mining become no longer 
effective and efficient. For instance, similarity learning, upon which 
various similarity-based tasks (e.g., ranking and clustering) can be launched, 
is extremely challenging for real applications with big data due to their 
typical features such as the data being heterogeneous, time-evolving, sparse 
and noisy. On the other hand, some data is generated exponentially or super-
exponentially in a streaming manner. Therefore, how to carry out real-time 
analysis on, and deep mining and understanding from big data so as to obtain 
dynamical and incremental information/knowledge, is another grand challenge. 
In general, at the era of big data, it is expected to develop new models, 
algorithms, methods, and even paradigms for mining, analyzing, and 
understanding big data.

This workshop aims to provide a networking venue that will bring together 
scientists, researchers, professionals, and practitioners from both industry 
and academia and from different disciplines (including computer science, 
social science, network science, etc.) to exchange ideas, discuss solutions, 
share experiences, promote collaborations, and report state-of-the-art 
research results and technological innovations on various aspects of mining 
and understanding from big data.

Scope and Topics:
-----------------

The topics of interest include, but are not limited to:

- Acquisition, representation, indexing, storage, and management of big data
- Processing, pre-processing, and post-processing of big data
- Models, algorithms, and methods for big data mining and understanding
- Knowledge discovery and semantic-based mining from big data
- Metric/similarity learning for big data
- Visualizing analytics and organization for big data
- Context data mining from big Web data
- Social computing over big Web data (e.g., network analysis, community 
    detection)
- Industrial and scientific applications of big data mining such as search 
    and recommendations

Important Dates:
----------------

Submission Deadline:   August 3, 2013
Authors Notification:  September 24, 2013
Workshop Date:         December 8, 2013


Paper Submission Guideline:
---------------------------

All papers need to be submitted electronically through the conference website 
(https://wi-lab.com/cyberchair/2013/icdm13/scripts/submit.php?subarea=DM) with 
PDF format. The materials presented in the papers should not be published or be 
under submission elsewhere. Each paper is limited to 8 pages including figures 
and references and follows the IEEE ICDM format requirements 
(http://icdm2013.rutgers.edu/author-instructions). 

Once accepted, the paper will be included into the conference proceedings published 
by IEEE Computer Society Press (indexed by EI). At least one of the authors of 
any accepted paper is requested to register the paper at the workshop.

Distinguished papers presented at the workshop, after further extension and 
revision, will appear in a Special Issue of the SCI-indexed Journal, namely,
Journal of Computer Science and Technology (JCST), Springer 
(http://jcst.ict.ac.cn:8080/jcst/EN/volumn/home.shtml).


Workshop Co-Chairs:
--------------------

- Xueqi Cheng, Institute of Computing Technology, CAS, China, [log in to unmask]
- Alvin Chin, Nokia, China, [log in to unmask]
- Charles X. Ling, Western University, Canada, cling@ csd.uwo.ca 
- Fei Wang, IBM T. J. Watson Research Center, USA, [log in to unmask]


Organizing committee:
---------------------

Jilei Tian	Nokia Research Center, China 
Guanling Chen	University of Massachusetts Lowell, USA, 
Enhong Chen	University of Science and Technology of China 
Jun Wang	IBM T.J. Watson Research Center 
Peng Cui	Tsinghua University, China
Irwin King	Chinese University of Hong Kong, China 


Program Committee:
------------------

(To be added)