DAMA + Wilshire Meta-Data Conference - Data Strategy Track
Have you read or seen much on the next evolution of the web called Web 2.0? Web 2.0 is a collection of technologies and frameworks that enable collaboration from a social perspective. We can see this transformation from Web 1.0 to the more collaborative Web 2.0 all around us. Products like Microsoft’s SharePoint, IBM’s WorkSpace, and many others are transforming the corporate landscape for knowledge and information collections. Outside the corporation, the success of Wikipedia, Folksonomies, RSS, and weblogs are undeniably changing every aspect of our lives. The Internet and associated technologies are under constant evolution. Within the world of metadata, we have seen three such transformations. After studying the Web 2.0 environment, I believe the next transformation is clear.

Attendees will learn the following:
  • An overview of Web 2.0 technologies
  • Review one organizations implementation of collaboration technologies
  • Review the next iteration of metadata value for the enterprise

In the gallery of business functions, no picture is more brightly drawn than information management. As executives embrace the idea of data as an asset, more of them are agreeing to invest in the tools, skills, and organizational enablers necessary to deploy it to the enterprise. This is a good news/bad news proposition: Yes, managers are getting it, but expectations are higher than ever. In this keynote session, author and consultant Jill Dyche looks at the 5 biggest trends in data management-courtesy of the early adopter companies that have made big changes to their information infrastructures and reaped the rewards-and discusses how corporate strategies are driving sweeping changes to data gathering, quality, integration, and propagation. Jill will tie in tactical ways for data management professionals to start leveraging their skills, business acumen, experience, and technology resources to claim a bigger piece of the information pie.
  •  Why "architecture" is no longer a dirty word (thanks to SOA)
  •  How new requirements techniques are changing the landscape
  •  Why success means "SCP" (Small, Controlled, Projects)
  •  Why loosely-coupled systems mandate tightly-coupled information skill sets
  •  Why master data management (MDM) is good news for everyone

One of the major challenges DM professionals face is gaining the trust of business sponsors & end users. More often than not, the first time a user sees the data is during user acceptance testing or after deployment. Full scale prototyping is not typically done because it is painful and costly.

A new concept being implemented within DM departments of leading organizations is the “Virtual Prototyping Warehouse.” This approach involves providing a temporary, virtual view into the combined data sources to allow end business users to rapidly define, test, edit, report from and validate the ultimate, physical data repository before it is implemented. By doing this, they gain immediate value from the data sharing initiative as well as a tool for managing changes to requirements.

In this session, Mr. Paat will outline:
  • The benefits of creating a "Virtual Prototyping Warehouse"
  • A detailed implementation process, including how to take advantage of existing investments and proven technologies
  • Real-life examples from a leading financial services company & a public sector agency
  • How this approach can be incorporated into an organization’s overall development lifecycle to reduce risk and uncertainty associated with major data integration efforts, while speeding business value and time-to-ROI.

“Data Governance” is top of mind for many corporate managers as the regulatory environment is becoming more rigorous. Businesses face mounting pressure to discover and protect hidden sensitive data elements, create master data repositories and reduce data inconsistencies across dataset boundaries. The first step for any of these kinds of data governance efforts is very basic: DISCOVERY: You can’t manage what you can’t find!

In the past, companies have attempted to use metadata to answer these questions. The problem is that metadata reflects what people assume is in their database…not the reality of what really exists. Learn how a new methodology called “data-driven mapping” and an accompanying technology eliminates the guesswork with a fact-based approach that efficiently examines millions of rows of data values to automatically discover hidden sensitive data, forgotten business rules and unknown data inconsistencies across disparate datasets. Customer case studies will include:
  • One of the world’s major credit card companies using this approach to quickly and affordably create an enterprise data flow map
  • One of the oldest and largest insurance companies identifying hidden sensitive data for de-identification prior to shipping overseas for outsourced development
  • The nation’s largest mutual fund company uncover forgotten business rules for a corporate master data management database.

The convergence of media and information technology is developing a need for new concepts in information storage. The merged media future will still need the data we have seen for the last fifty years, but in addition there are requirements to store and retrieve new types of data, both structured and unstructured, dramatic increases in the volume of structured and unstructured data, uncontrolled data entry resulting in vast amounts of data of unknown quality and a new ability to mix data sources in a single information access. The new data storage system must understand the information framework that it manages and the context of the source and destination of information.

This presentation will examine the application of concepts of human understanding to electronic data storage systems. Both the theoretical aspects of computer understanding and practical applications of this new technology will be presented along with case studies of some initial efforts in this area.



Companies worldwide are plagued by theft of laptops, intellectual property, and worker and customer identities, resulting in unwanted media attention, compliance violations, unbudgeted financial impact, and irreversible brand and reputation damage. Discovering where a company’s critical documents and customer data reside and protecting them from unlawful access, printing, copying, and downloading has become an important business requirement in today’s digital economy. In this presentation, the speaker will examine current best practices that global companies employ to protect against loss of customer data and intellectual property when stored on worker desktops and mobile systems, monitoring and controlling content without impeding the flow of business, and survey current content detection technologies that can provide visibility and control over confidential documents and records in order to inventory and track sensitive data in real-time.

Attendees will learn:
  • How to discover, lockdown and eliminate information leakage before it occurs based on content, user, system or drive
  • Using alerts to help train users on corporate policy at the time of the event to increase user awareness and reduce the risk of loss without impeding the flow of business
  • Achieving continual protection of laptop content even when disconnected while controlling access, saving, downloading, printing, copying, and moving of data
  • How to establish sophisticated custom monitoring categories to look for activity and information that is uniquely company-specific or proprietary to the nature of your business.

    Wednesday, March 7th
    5:30 pm - 6:30 pm

  • Digital Fingerprints - Bonnie O'Neil, PPC; John Murphy, John Alton Murphy, Inc.
Computers used in the commission of a crime often leave a trail of evidence that is entrapped in the data stores, archives and mail boxes of the corporation. The challenge is not so much in their storage as in their recovery. Litigation has repeatedly pointed to the need for not only “storing” data but for having effective and timely mechanisms for its recovery. The delivery of the “facts” without the context provided by metadata can result in bad decision making and errant judgments. Metadata, when properly used, can significantly reduce the resources, time and costs required for the e-discovery process.

This presentation includes:
  • Several strategies related to forensic electronic discovery (e-discovery)
  • Standards in metadata for documents, e-mail, presentations, web content as well as relational data and blogging content.
  • Specific examples of metamodel content as well as real world cases
  • The construction of a governance and compliance metamodel to contain specific regulatory and potential litigation defense information
  • Metadata for data retention: used to track data lifecycle and to enforce specific retention periods and authorities associated with its management and disposal

Do you have a need to integrate data from multiple different structured and unstructured sources? Are you concerned about duplicate data and which records are accurate? These are just some of the challenges that face IT professionals and project managers who work on systems integration initiatives on a daily basis. When integrating data from multiple systems, sometimes containing structured and unstructured data, two critical components of Data Cleansing emerge; Entity Resolution and Entity Extraction.

Entity Resolution is a form of Data Cleansing and is better known as the “de-duplication” of data or more accurately the process of identifying and linking records together that could be the same entity. Entity Resolution is generally performed on data, formatted in fixed fields, and residing in a structured format.

Entity Extraction is a form of Data Cleansing used during Data Integration specifically focusing on unstructured data. Sometimes referred to as “Text Mining” or “Information Extraction”, Entity Extraction is the process by which unstructured data in files like word documents, email, and PDF files can be searched and given meaning from the body of text.
  • Entity Resolution & Entity Extraction defined
  • Data Integration Pillars
  • Entity Resolution
  • Standardization
  • Matching
  • Survivorship
  • Entity Extraction
  • Business Need
  • Benefits of Entity Resolution & Entity Extraction
  • Entity Resolution Case Study (FDIC CAS)

The OMG’s Business Process Modeling notation, BPMN, was designed to be intuitive and easy-to-use for to business and non-technical users, while supporting robust and detailed semantics for technical users. This session will provide an introduction to the BPMN: its history, purpose, an overview of the notation, and hands-on examples. Tutorial attendees can expect to have a basic working knowledge of the notation by the end of this session.

Session content includes:
  1. What the BPMN was designed to do (and not do)
  2. Notation Overview
  3. Hands-On Examples
    • “Business-Focused” modeling
    • Designing models for execution
    • Where BPEL Fits in
  4. Where do we go from here
    • Implementing Business Process Modeling in your organization
    • Industry Vertical templates and examples
    • How Process Modeling aligns with other initiatives (Data Modeling, Enterprise Architecture, BPM Suites etc.)

A number of key innovations in web-based technologies have come to be known collectively as “Web 2.0.” These innovations offer small organizations, consultants, and even individual knowledge workers, new ways to communicate with customers, and improve the performance of their businesses. Through case studies, demonstrations and discussions, this workshop will demonstrate the big ideas and new capacities behind each of these new Web 2.0 technologies, how to be successful with them, and what obstacles to avoid. You’ll learn how to apply the ideas to your own organizational and/or departmental marketing and operational needs.
  • New forms of online presence
  • The New New Collaboration
  • Personal Information Management and the New Desktop
  • Podcasting for Learning
  • Implementing Web 2.0

Close Window

  Wishire Conferences DAMA International