talk: Morik on Data Analytics for Sustainability, 11am Thr 5/22, ITE456

wikipedia

Computer Science and Electrical Engineering
University of Maryland, Baltimore County

Data Analytics for Sustainability

Professor Katharina Morik
TU Dortmund University, Germany

11:00am-12:30pm, Thursday 22 May 2014, ITE 456, UMBC

Sustainability has many facets and researchers from many disciplines are working on them. Particularly knowledge discovery always considered sustainability an important topic (e.g., special issue on data mining for sustainability in Data Mining and Knowledge Discovery Journal, March 2012).

  • Environmental tasks include risk analysis concerning floods, earthquakes, fires, and other disasters as well as the ability to react to them in order to guarantee resilience. The climate is certainly of influence and the debate on climate change received quite some attention.
  • Energy efficiency demands energy-aware algorithms, operating systems, green computing. System operations are to be adapted to a predicted user behavior such that the required processing is optimized with respect to minimal energy consumption.
  • Engineering tasks in manufacturing, assembly, material processing, and waste removal or recycling offer opportunities to save resources to a large degree. Adding the prediction precision of learning algorithms to the general knowledge of the engineers allows for surprisingly large savings.

Global reports on the millennium goals and open government data regarding sustainability are publicly available. For the investigation of influence factors, however, data analytics is necessary. Big data challenges the analysis to create data summaries. Moreover, the prediction of states is necessary in order to plan accordingly. In this talk, two case studies will be presented. Disaster management in case of a flood combines diverse sensor data streams for a better traffic administration. A novel spatiotemporal random field approach is used for smart routing based on traffic predictions. The other case study is in engineering and saves energy in the steel production based on the multivariate prediction of the processing end-point by the regression support vector machine.

Further reading:

  • Katharina Morik, Kanishka Bhaduri, Hillol Kargupta “Introduction to Data Mining for Sustainability”, Data Mining and Knowledge Discovery Journal, Vol. 24, No.2, pp. 311 – 324, 2012.
  • Nico Piatkowski, Sangkyun Lee, Katharina Morik “Spatio-Temporal Random Fields: Compressible Representation and Distributed Estimation”, Machine Learning Journal Vol.93, No. 1, pp: 115-139, 2013.
  • Jochen Streicher, Nico Piatkowski, Katharina Morik, Olaf Spinczyk “Open Smartphone Data for Mobility and Utilization Analysis in Ubiquitous Environments” In: Mining Ubiquitous and Social Environments (MUSE) workshop at ECML PKDD, 2013.
  • Norbert Uebbe, Hans Jürgen Odenthal, Jochen Schlüter, Hendrik Blom, Katharina MorikA novel data-driven prediction model for BOF endpoint. In: The Iron and Steel Technology Conference and Exposition in Pittsburgh (AIST), 2013.
  • Alexander Artikis, Matthias Weidlich, Francois Schnitzler, Ioannis Boutsis, Thomas Liebig, Nico Piatkowski, Christian Bockermann, Katharina Morik, Vana Kalogeraki, Avigdor Gal, Shie Mannor, Dimitrios Gunopulos, Dermot Kinane, “Heterogeneous Stream Processing and Crowdsourcing for Urban Traffic Management” Procs. 17th International Conference on Extending Database Technology, 2014.

Katharina Morik is full professor for computer science at the TU Dortmund University, Germany. She earned her Ph.D. (1981) at the University of Hamburg and her habilitation (1988) at the TU Berlin. Starting with natural language processing, her interest moved to machine learning ranging from inductive logic programming to statistical learning, then to the analysis of very large data collections, high-dimensional data, and resource awareness.

Her aim to share scientific results strongly supports open source developments. For instance, RapidMiner started out at her lab, which continues to contribute to it. She was one of those starting the IEEE International Conference on Data Mining together with Xindong Wu, and was chairing the program of this conference in 2004. She was the program chair of the European Conference on Machine Learning (ECML) in 1989 and one of the program chairs of ECML PKDD 2008. She is in the editorial boards of the international journals “Knowledge and Information Systems” and “Data Mining and Knowledge Discovery”. Since 2011 she is leading the collaborative research center SFB876 on resource-constrained data analysis, an interdisciplinary center comprising 12 projects, 19 professors, and about 50 Ph. D students or Postdocs.

Host: Hillol Kargupta,

PhD proposal: Das on Privacy & Security Management on Mobile Devices, 8am Fri 5/16

das

PhD Dissertation Proposal

Learning and Executing Energy Efficient, Context-Dependent
Rules for Privacy and Security Management on Mobile Devices

Prajit Kumar Das

8:00am Friday, 16 May 2014, ITE325b

There are ongoing security and privacy concerns around mobile platforms that are increasingly being used by citizens. For example a newly discovered security flaw in WhatsApp that allows hackers using a malicious app to read chat messages stored on the SD card. The Brightest Flashlight application was reported to have logged precise location and a unique user identifier, which have nothing to with its intended functionality. Current mobile platform privacy and security mechanisms are limited to an initial installation phase permission acquisition method. In addition to that, the permissions are of the all or none form. This means that either the users accept all the permissions requested by the mobile app or they cannot use the app in question. Even if permissions were not structured as such, typically, users do not understand the permissions being requested or are too eager to use the application to even care to read them. These issues are present in all major mobile operating systems. Given the penetration of mobile devices into our lives, a fine-grained context-dependent security and privacy control approach needs to be created.

We propose a framework that will allow us to learn the privacy and security rules for a particular user, on their mobile devices. We do this by employing a simple user feedback mechanism. The rule learning framework consists of a “learning mode” where it observes and learns from user behavior and a “working mode” where it implements the learned rules to protect user privacy and provide security. The rules are represented to the user in plain English using an easily understandable construct. The rules are internally written in a logic based language and using Semantic Web technologies. The antecedents of the rules are context elements that are derived from an ontology using a query engine and an inference mechanism. The main contributions of our work include learning modifications to current rules and learning new rules to control the data flow between the various data providers on the user’s mobile device, including sensors and services and the consumer of such data. The privacy and security rule execution consumes significant energy due to the context detection. We create an energy model that allows us to make energy cost optimizations with regards to rule execution. We use a three-fold solution for achieving the said energy cost optimizations.

Committee: Drs. Anupam Joshi (chair), Nilanjan Banerjee, Dipanjan Chakraborty (IBM), Tim Finin, Tim Oates, Arkady Zaslavsky (CSIRO)

PhD proposal: Yatish Joshi on connectivity restoration in wireless sensor networks

PhD Proposal

Distributed protocols for connectivity restoration
in damaged wireless sensor networks

Yatish K. Joshi

1:00pm Monday, 12 May 2014, ITE325b, UMBC

Decreasing costs and increasing functionality of embedded computation and communication devices have made Wireless Sensor Networks (WSNs) attractive for applications that serve in inhospitable environments like battlefields, planetary exploration or environmental monitoring. WSNs employed in these environments are expected to work autonomously and extend network lifespan for as long as possible while carrying out their designated tasks. The harsh environment exposes the individual nodes to q high risk of failure, which can potentially partition the network into disjoint segments. Therefore, a network must be able to self-heal and restore lost connectivity using available resources. The ad-hoc nature of deployment, harsh operating environment and lack of resources makes distributed approaches the most suitable choice for recovery.

Most solution strategies for tolerating the failure of multiple collocated nodes are based on centralized approaches that pursue the placement of additional relays to form a connected inter-segment topology. While they are the ideal solution for dealing with simultaneous multi-node failures, they need to utilize the entire network state to determine where and how recovery should occur. In addition to the scalability concern of these approaches, controlled placement of stationary relays in remote and inhospitable deployment area may not be logistically feasible due to resource unavailability and would not be responsive due to the delay in transporting the resources to the area. Space exploration is an example of those WSN applications in which placement of stationary relays is not practical.

In this proposal, we tackle the problem of connectivity restoration in a partitioned WSN in a distributed manner. We consider multiple variants of the problem based on the available resources and present novel recovery schemes that suit the capabilities and count of existing nodes.

Committee: Drs. Mohamed Younis (Chair), Dr. Charles Nicholas, Dr. Chintan Patel, Dr. Kemal Akkaya (SIU-Carbondale)Dr. Waleed Youssef (IBM)

MS defense: Bansal on Recoloring Web Pages for CVD

MS Thesis Defense

Recoloring Web Pages For Color Vision Deficiency Users

Vikas Bansal

11:00am Thursday, May 8, 2014, ITE346, UMBC

Color vision begins with the activation cone cells. When one of the cone cells dysfunction, color vision deficiency (CVD) ensues. Due to CVD, users become unable to differentiate as many colors a normal person can. Lack of this ability results in less rich web experience, incomprehension of basic information and thus frustration. Solutions such as carefully choosing colors while designing or recolor web pages for CVD users exist. We first present the improvement in the time complexity of an existing tool SPRWeb to recolor web pages. After that we present our tool which explores the foreground-background relationship between colors in a web page. Using this relationship we propose an algorithm which preserves naturalness, pair-differentiability and subjectivity. In the last part, we add an additional step in to algorithm to ensure that the contrast in the parsed color pairs meets the required W3C guidelines. In evaluation, we found that our algorithm does significantly better in preserving pair-differentiability and produces lower total cost solutions than SPRWeb. Quantitative experimentation of modified algorithm shows that contrast ratio in each replacement pair is more than 4.5 as required for readability.

Committee: Drs. Lina Zhou (co-chair), Tim Finin (ch-chair), Yelena Yesha, Dongsong Zhang

talk: Ron Ross (NIST) on Cybersecurity, 6pm Wed 4/30

dhs_govtpanel

UMBC Information Systems Security Association Seminar

Framework for Improving Critical
Infrastructure in Cybersecurity

Dr. Ron Ross, NIST

6:00-8:00pm Wednesday, 30 April 20014
Meyerhoff 030 Building Lecture Hall 2

RSVP
Schedule:
6:00-6:30pm Introductions to UMBC ISSA, Networking & Pizza
6:30-7:30pm Cyber Security Lecture From Dr. Ron Ross
7:30-8:00pm Networking

Host: Monique Jeffrey, UMBC ISSA President,

Ron Ross is a Fellow at the National Institute of Standards and Technology (NIST). His current areas of specialization include information security and risk management. Dr. Ross leads the Federal Information Security Management Act (FISMA) Implementation Project, which includes the development of security standards and guidelines for the federal government, contractors, and the United States critical information infrastructure.

A graduate of the United States Military Academy at West Point, Dr. Ross served in a variety of leadership and technical positions during his over twenty-year career in the United States Army. While assigned to the National Security Agency, he received the Scientific Achievement Award for his work on an inter-agency national security project and was awarded the Defense Superior Service Medal upon his departure from the agency. Dr. Ross is a three-time recipient of the Federal 100 award for his leadership and technical contributions to critical information security projects affecting the federal government and is a recipient of the Department of Commerce Gold and Silver Medal Awards.

Dr. Ross has been inducted into the Information Systems Security Association (ISSA) Hall of Fame and given its highest honor of ISSA Distinguished Fellow. Dr. Ross has also received several private sector cyber security awards and recognition including the Vanguard ChairmanÕs Award, the Symantec Cyber 7 Award, InformationWeek’s Government CIO 50 Award, Best of GTRA Award, and the ISACA National Capital Area Conyers Award. During his military career, Dr. Ross served as a White House aide and as a senior technical advisor to the Department of the Army. Dr. Ross is a graduate of the Defense Systems Management College and holds Masters and Ph.D. degrees in Computer Science from the U.S. Naval Postgraduate School specializing in artificial intelligence and robotics.

talk: White House Climate Data Initiative, 3pm Tue 4/29

wikipedia

Center for Hybrid Multicore Productivity Research
Distinguished Computational Science Lecture Series

The White House Climate Data Initiative

Eric Letvin
Director, Disaster and Failure Studies
National Security Council

3:00pm Tuesday, 29 April 2014, ITE 456, UMBC

Delivering on the commitment in the President’s Climate Action Plan, the White House recently launched the Climate Data Initiative — a broad effort to leverage the Federal Government’s extensive, freely- available climate-relevant data resources to advance awareness of and preparedness for climate change impacts. This effort will help give communities across America the information and tools they need to plan for current and future climate impacts. Data from NOAA, NASA, the U.S. Geological Survey, the Department of Defense, and other Federal agencies was recently launched on climate.data.gov. Data and innovation challenges issued by public, private, nonprofit, and other organizations can help catalyze new, data-driven solutions that help communities understand and build resilience to climate change. NOAA and NASA recently announced an innovation challenge calling on researchers and developers to create data-driven simulations to help plan for the future and to educate the public about the vulnerability of their own communities to sea level rise and flood events.

Mr. Eric Letvin PE, Esq, is the Director of Hazard Mitigation and Risk Reduction Policy within the National Security Council in the Executive Office of the President. He coordinates the development and effective delivery of mitigation capabilities identified in the National Preparedness Goal, such as threat and hazard identification, risk and disaster resilience assessment, planning, and long-term vulnerability reduction.

When at NIST, Mr. Letvin is the Disaster and Failure Studies Program Director within NIST’s Engineering Laboratory. Mr. Letvin provides national coordination for conducting field data collection studies. He is also responsible for creating and maintaining a repository related to hazard events (earthquakes, hurricanes, tornadoes, windstorms, community-scale fires in the wildland-urban interface, structural fires, storm surge, flood, tsunami) and human-made hazards (accidental, criminal, or terrorist), the performance of the built environment during hazard events, associated emergency response and evacuation procedures.

Before coming to NIST, Mr. Letvin was Leader of Infrastructure Research and Resiliency in the Homeland Security Group of URS. He has participated in numerous post-disaster studies including the bombing of the Murrah Building in Oklahoma City, and Hurricanes Opal, Ike and Katrina. He has assessed over 200 buildings for risk from terrorist threats and natural disasters.

Mr. Letvin holds a bachelor’s and master’s degree in environmental engineering from Syracuse University and received his Juris Doctor from the University of Maryland. He has taught many courses on risk assessments and protection of infrastructure for FEMA/DHS and made related presentations throughout the world over the last ten years.

talk: Translational Bioinformatics Approaches to Evaluate and Implement Genomic Medicine Programs, 1pm 4/25

Human genome, wikipedia

Translational Bioinformatics Approaches to Evaluate
and Implement Genomic Medicine Programs

Dr. Casey Overby, Assistant Professor

Program for Personalized and Genomic Medicine
University of Maryland – Baltimore

1:00pm Friday, 25 April 2014, ITE 325b, UMBC

There is a growing evidence base to support the use of many genomic applications in healthcare. There are, however, several barriers to healthcare providers making use of genomic data and information on a routine basis. In this talk, I will describe some of our challenges and successes with implementing genomic medicine programs within the Program for Personalized and Genomic Medicine at UMB, introduce one way to conceptualize translational research and translational bioinformatics in this context, describe a proposed model for evaluating and implementing genomic medicine programs, and describe some of my current and planned research in translational bioinformatics.

Casey L. Overby is an Assistant Professor of Medicine in the Program for Personalized and Genomic Medicine and the Center for Health-related Informatics and Bio-Imaging at the University of Maryland School of Medicine. She received her Masters of Biotechnology from the University of Pennsylvania in 2006, her PhD in Biomedical and Health Informatics and a Graduate Certificate in Public Health Genetics from the University of Washington in 2011. In 2013, she completed her post-doctoral training in the Department of Biomedical Informatics at Columbia University and started her position at University of Maryland, Baltimore.

Host: Marie desJardins,

defense: Learning Hierarchical Workflows Using Community Detection, 4/18

MS Thesis Defense

Learning Hierarchical Workflows Using Community Detection

Akshay Peshave

1:00pm Friday, 18 April 2014, ITE 325b

Workflows identified from user event logs and click-stream data are useful as knowledge bases for behavioral analysis and recommendation systems. In this study we identify abstractions or summaries of event logs modeled as user activity flow networks. The abstractions are identified based on structural properties as well as user activity flow dynamics over the network using community detection methods. We apply a fast modularity optimization and multi-level resolution approach to detect hierarchical community structure in user activity flow networks. The detected communities are compared to those detected by the information-theoretic map equation minimization approach to weigh pros and cons of the fast modularity optimization approach in the workflows context. We further attempt to identify the most probable sources and sinks of user activity in individual communities and trim the network accordingly to reduce entropy of the workflow abstractions.

Committee: Drs. Tim Oates (chair), Matt Schmill and Tim Finin

defense: Rosebrock on Image Classification, 9am 4/18

wikipedia

Computer Science and Electrical Engineering
University of Maryland, Baltimore County
Ph.D. Dissertation Defense

A Rapidly Deployable Image Classification System Using Feature Views

Adrian Rosebrock

9:00am Friday, 18 April 2014, ITE 346, UMBC

Constructing an image classification system using strong, local invariant descriptors is both time consuming and tedious, requiring much experimentation and parameter tunings to obtain an adequate performing model. Furthermore, training a system in a given domain and then migrating the model to a separate domain will likely yield poor performance. As the recent Boston Marathon attacks demonstrated, large, unstructured image databases from traffic cameras, security systems, law enforcement officials, and citizens can be quickly amassed for authorities to review; however, reviewing each and every image is an expensive undertaking, in terms of both time and human effort. Inherently, reviewing crime scene images is a classification task. For example, authorities may want to know if a given image contains a suspect, a suspicious package, or if there are injured people in the photo. Given an emergency situation, these classifications will be needed as quickly and accurately as possible. In this work we present a rapidly deployable image classification system using “feature views”, where each view consists of a set of weak, global features. These weak global descriptors are computationally simple to extract, intuitive to understand, and require substantially less parameter tuning than their local invariant counterparts. We demonstrate that by combining weak features with ensemble methods we are able to outperform current state-of-the-art methods or achieve comparable accuracy with much less effort and domain knowledge. We then provide both theoretical and empirical justifications for our ensemble framework that can be used to construct rapidly deployable image classification systems called “Ecosembles”.

Finally, we recognize the fact that image datasets give us the relatively unique opportunity to extract multiple feature representations through the use of various descriptors. In situations where the original dataset is not available for further feature extraction or in cases where multiple feature views are ambiguous (such as predicting income based on geographical location and census data) the Ecosemble method cannot be applied. In order to extend Ecosembles to arbitrary datasets of diverse modalities, we introduce artificial feature views using kernel approximations. These artificial feature views are constructed from a single representation of the data, alleviating the need to explicitly extract multiple feature views. We then apply artificial feature views to a diverse range of non-image classification datasets to demonstrate our method is applicable to multiple modalities, while still outperforming current state-of-the-art methods.

Committee: Drs. Tim Oates (chair), Jesus Caban, Tim Finin, Charles Nicholas, Jian Chen

talk: A multi-scale approach to analyze large clinical datasets, Noon Thr 4/10

A multi-scale approach to analyze large clinical datasets:
Towards the understanding of the complex effects of concussions

Dr. Jesus Caban
National Intrepid Center of Excellence
Walter Reed, Bethesda, MD

Noon Thursday, 10 April 2014, ITE325b

Mild traumatic brain injuries (mTBIs) or concussions are invisible injuries that are poorly understood and their sequelae can be difficult to diagnose. Individuals who have had concussions are at an increased risk of depression, post-traumatic stress disorder (PTSD), headaches, concentration difficulties, and other problems. During the last decade, a significant amount of attention has been given to the acquisition of clinical data from patients suffering from mTBI. Unfortunately, most of the data collection and analysis have focused on individual aspects of the injury, not necessarily on comprehensive and multi-modal analytical techniques to capture the complex biological state of mTBI patients.

This talk will discuss a large-scale informatics database that has been developed to enable interdisciplinary research on mTBI and will introduce a multi-scale approach to mine complex clinical datasets. The millions of multi-modal elements originated from different clinical disciplines are treated as weak features and modeled independently to generate stronger features. Three cases of going from weak to stronger features will be discussed including (a) an inductive/transductive model to extract stable image features from multi-modal MRI scans, (b) a rule-based model used to infer knowledge from blood measurements, and (c) a sentiment analysis-based model to extract behavioral signals from writing samples. Once stronger features are obtained, a relational model is used to integrate the data and extract new knowledge from such a complex dataset.

Dr. Caban is the Acting Chief of Clinical & Research Informatics at the National Intrepid Center of Excellence (NICoE) at Walter Reed Bethesda. He received a Ph.D. in Computer Science from UMBC (2009), his M.S. degree in Computer Science from the University of Kentucky (2005), and his B.S. in Computer Science from the University of Puerto Rico (2002). Over the last eight years Dr. Caban’s research has focused on the design and development of techniques to analyze clinical and imaging data. His research and experience has given him the opportunity to work at top research and healthcare organizations including the National Institutes of Health (NIH), John Hopkins University, the University of Maryland Medical Center, and IBM Research. Dr. Caban is presently an adjunct faculty member at John Hopkins University Applied Physics Lab and a part-time instructor at the Department of Computer Science at UMBC. Recently, he received the 2013-14 JHU/APL Junior faculty award for his commitment to teaching. Currently, he is serving as the Associate Editor of the JAMIA special issue on Visual Analytics in Healthcare and as the contracting officer representative (COR) for the DoD program on “Watson-Like Technologies for TBI/PTSD Clinical Decision Support and Predictive Analytics”.

1 28 29 30 31 32 58