Conference Sessions
Work in Progress - Preliminary Program


Tuesday Afternoon Sessions

KEYNOTE: DATABASE and SOFTWARE TRENDS

Chaired by:
Ken North
Consultant, Software Developer and Author


Panel of Experts:

  • Bob Bickel, Founder, Bickel Advisory Service
  • John Goodson, VP of Product Operations, DataDirect Technologies
  • Dean Guida, CEO, Infragistics
  • Geoff Brown, Technical Director, ATS Core Technologies, Oracle


Data Integration: Strategic and Business Perspectives

Katherine Hammer
President and CEO
Evolutionary Technologies

When Best Practices Aren't Good Enough
Katherine Hammer

 

One of the largest contributors to the general skepticism about technology is the failure to understand the complexity and importance of data integration management to every IT initiative. While there is a general understanding that data integration is important - to business intelligence, EAI, CRM, and so on - there is little attempt to treat data integration as an key factor in an effective enterprise architecture. One of the largest contributors to this failure is the process large organizations use to evaluate and purchase software, and this failure is a large part of why the cost of software ownership remains so high. This presentation analyzes these major problems with the way software evaluation is conducted and suggests how companies can use a few key criteria and a revised methodology to solve this problem.

  • Application consolidation
  • Cultural conflicts of interest
  • Limitations of industry analysts
  • Change management
  • Risk management

Service-Oriented Integration and Process

Ron Schmelzer
Founder and Senior Analyst
ZapThink, LLC


Integration is not about simply plugging two systems or organizations into each other. The vision of "plug and play" application and system integration is a pipe dream that may be appropriate for in the distant future, but right now enterprises face the more immediate challenge of connecting arbitrary systems in a manner that is cost effective, manageable, efficient and secure. Ron Schmelzer, senior analyst, ZapThink, gives you soup-to-nuts expertise for Web services and the Service-Oriented Architecture (SOA), as they represent an approach for integrating systems using an abstracted methodology called Service-Oriented Integration (SOI). Discover how Web services is becoming a key element to simplify and enable integration between legacy, heterogeneous and disparate systems. Gain real-world advice on:

  • The use of Web services for integration
  • How EAI and B2B integration are merging and what this means to you
  • Web services integration with mainframe and legacy applications
  • How SOI technologies and approaches solve lingering integration issues
  • Basic elements of SOI
  • Drivers and motivators for SOI adoption
  • Market segmentation
  • Key vendors and technologies in the SOI space

A Little Appreciation - Is it too much to ask for?

Moderated by Graeme Simsion

Panelists:

Graham Witt


Karen Lopez


Joe Maguire


Len Silverston


The burning issue for many modelers is to be relevant, needed and appreciated: it's a day-to-day issue in terms of winning arguments on approach and design at the project level, and a career issue in the sense of "is there a future for data modeling professionals?".

This panel looks at the practical issues facing the professional modeler (data or data and process) in today's world, and what this portends for the role of modelers in the future.

  • How do you convince the stakeholders of the importance of modeling?
  • How do you deal with disputes?
  • What happens in an agile project / package implementation / outsourced development?
  • What will be the keys to staying relevant and employed? How do the consultants do it? Are there secrets you can learn from?

Unstructured Data Management:
Concepts, Tools, Applications and Future Directions

David Raab, Partner, Raab Associates (moderator)

Barak Pridor, CEO, ClearForest

David Bayer, Stratify

Jose Colon, Vice President of Technology, Autonomy


Unstructured data compromises the majority of corporate information, but is often difficult or impossible to access. This session explores the types of unstructured data, the challenges of managing it, and the technologies available to help meet those challenges. The first part will present the concepts of unstructured data management, including ways of developing taxonomies to organize unstructured data, manual vs. automated techniques for assigning documents within a taxonomy, methods for assessing similarity between documents, user interfaces to access large unstructured data stores, and reasons that generic search engines are often inadequate. It will also describe requirements for common applications including text search, community identification, collaboration, alerting, link analysis and summarization.

The second part of the discussion will bring in 3 of the industry’s leading companies to talk about the tools that are currently available to manage unstructured data (including some demos), and where the technology will go next. This moderated discussion will explore the biggest challenges and opportunities in this nascent field, how to integrate unstructured sources with structured data, and where to look for the high-payoff applications.

  • Definition and importance of unstructured data (i.e., text, video, audio, etc.; I'd mostly focus on text)
  • Taxonomies and meta-tagging in managing unstructured data
  • Typical applications and their requirements
  • Technologies for text categorization and search/access
  • Discussion and classification of major vendors

Wednesday Sessions

UML for Database Design

Terry Quatrani
UML Evangelist
IBM


Finally database modelers can now reap the same rewards from using the Unified Modeling Language as application developers have been doing for years. Application and database modelers can now speak one language - UML. No longer are database analysts, modelers, and designers relegated to the tail end of the development lifecycle. Now database designers can participate from the inception of the project, helping shape those early decisions that often have a critical impact on the system’s data. Also, being able to link together the object and data models, and thereby improving the understanding of both, helps yield higher quality systems.

Attendees will learn about:

  • The basic UML diagrams
  • The UML profile for data modeling
  • The relationship between the UML diagrams and conceptual, logical, and physical data modeling
  • The benefits of each UML diagram to the database designer; How UML helps to jumpstart their database design.

To Federate or Consolidate: 10 Things to Consider Regarding Data Integration

Ho-Chun Ho
President
HoTech Corp


Despite our best intentions, it is the reality in modern enterprises that different parts of an organization use different systems to gather, store, and deliver critical data. Only by combining the information from various systems can an organization realize the full value of these disparate data sources. There are many mechanisms to integrate data: data layer integration, data access layer integration, application-specific solutions, application-integration frameworks, workflow or business process integration frameworks, digital libraries with portal-style integration, search-engine-oriented integration, data warehousing, and database federation. When architecting data in support of any of these integration options data architects often face a tough decision-- we need to choose between consolidating data and federating data. This presentation will share the insight including technological issues, organizational impact, as well as cost factors. Among other considerations, this presentation will address:

  • Basic principles and criteria for exceptions
  • Data quality and integrity
  • Transaction characteristics
  • Performance, currency, and availability
  • Abusive reuse of API's and middleware
  • Structured, semi-structured and unstructured contents
  • Ownership and turf wars
  • Actionable meta-data

XML, Your Data, and You

Evan Levy
Senior Partner
Baseline Consulting Group


There's lots of buzz about XML as a data management enabler, but XML is no silver bullet. In this session, Evan Levy covers where XML fits with your data management, data warehouse, or data integration strategy. He navigates the benefits and pitfalls of using XML as a data management solution, including:

  • Popular myths about XML
  • XML for moving data
  • XML for accessing data
  • XML and ETL
  • XML and XQuery
  • Key job roles and XML

Evan will provide a checklist of the real-world challenges XML faces in corporate IT environments. He will discuss real-life case studies of XML in action at companies (a case study of a bank, and a case study of a communications company) which have succeeded in leveraging it for bona-fide business benefit.


Data Profiling Technology
How to Recover Metadata and More

Jack Olson
CTO
Evoke Software


Most IT projects depend on accurate and complete metadata for successful completion. However, this is rarely available. This presentation describes how the emerging hot technology, data profiling, is used to recover metadata from the data and provide other information such as information on data content and quality. Learn from Jack Olson - the person responsible for creating the data profiling space - about:

  • How to extract attribute properties from data
  • How to extract structural relationships from data
  • How to formulate and validate data rules from data
  • How to locate data quality accuracy problems in the data
  • Why data profiling technology can deliver what other techniques cannot

Active Metadata

Adrienne Tannenbaum
President
Database Design Solutions


Organizing data across the enterprise has always been a challenge. Today we are also deciding whether or not to organize metadata across the enterprise. With so much metadata and so little time many metadata solution efforts involve an assessment of all existing metadata's accuracy and usefulness. When discrepancies are identified, subsequent metadata solutions are required not only to represent updated and accurate metadata, but also to ensure its livelihood. Active metadata is always up-to-date since it is not a copy or redundant version of that information required to identify, define, locate, source, and access the associated instance data. When all metadata is active, metadata solutions are always current and useful. In many scenarios "active metadata" is created automatically when we create whatever is being described. From that point forward the metadata is always as functional as it can be. Implementing a metadata solution within the boundaries of today's corporate information environment requires active metadata functionality. Adrienne Tannenbaum will explain the role a metadata solution should play in delivering and maintaining active metadata. She will contrast active vs. passive metadata connections, illustrate various "active" architectures, and discuss the ideal approach to developing an active metadata solution.


Name/Address Matching and Consolidation

David Raab
Partner
Raab Associates


Name and address matching has become a huge priority in recent years, in a wide variety of contexts, from CRM systems to fraud detection, and from data quality applications to identity profiling for homeland security. In this session, industry expert David Raab provides an overview of the issues and technologies in identity matching. He'll discuss the different challenges of US and foreign data, provide an introduction to the tools that are available in the market today, and offer tips on testing the alternatives.

  • Overview of name/address matching process and issues
  • Competing technical approaches with strengths and weaknesses
  • Dealing with non-U.S. data
  • Brief discussion & classification of major vendors
  • Testing alternative systems

A Comparison of UML Class Models and ERDs for Data Modeling

Paul Dorsey
President
Dulcian, Inc.


It has been suggested that UML class models cannot be used for data modeling. Some recognize that it can be done but should not be done. UML class diagrams have not traditionally been used for data modeling but have mainly been used for object-oriented database design. This presentation will demonstrate how, with some natural extensions, UML class diagrams can be used very effectively to create a modeling environment that is superior to ERDs.

A number of extensions to UML class diagrams, including primary key specification, History and Audit keywords, display functions and derived attributes will be discussed. Specific examples where UML representations are clearer and more precise than their ERD counterparts will be shown.

Specific topics to be discussed in this presentation include:

  • UML Advantages:
    - Aggregation and composition
    - Generalization
    - Cardinality
    - Stereotypes, constraints and keywords
    - Interfaces
  • Extensions to support data modeling:
    - Data modeling
    - Logical primary keys
    - Audit
    - Display functions
    - Derived attributes
    - Domains (value list, validation)
  • Examples:
    - Many-to-many
    - Reference tables
    - Cardinality
    - Use of Interfaces
    - Use of derived attributes
    - History and audit

Speaker Note: In 1998, I wrote what was probably the first book on using UML for data modeling. I recognize that using UML for data modeling is a very tough problem. But now that I exclusively model using UML, I must conclude that it really is better than ERDs. Of course, the factors that make it a better method are largely the extensions used to help the data modeling environment. Those same extensions could have been added to ERDs, but generalization, methods, aggregation, composition, and association relationships are all places where UML has a native advantage over ERDs.


Exploring the Converging Worlds of Data Integration Tools

Faisal Shah
Chief Technology Officer
Knightsbridge Solutions


ETL … EAI … Web services … Do these product categories even exist any more? They do right now, but Faisal Shah will explore the future of these tools and how organizations in need of data integration should change their thinking before selecting a specific tool or tool category. While the categories are experiencing convergence among features, he will also point out significant breakout features among select products that make them leaders in their category. Faisal will walk through features you should consider when making a data integration tool decision, and how and organization should evaluate their current and future feature needs and try to match the appropriate tool that will provide the best long-term investment.

  • How data integration tool categories will evolve
  • How to determine your organizational data integration tool needs
  • How to select the appropriate integration tool
  • How to ensure the best long-term integration tool decision

XML: A Component of Metadata and Enterprise Architecture

Ben Jenkins
Senior Architect
BellSouth


This session focuses on how the Extensible Markup Language (XML) applies to metadata, and offers an overview of one real-life implementation of an enterprise XML repository. Participants will learn the fundamental segments of XML architecture. As the universal language for data on the web, XML gives you the power to deliver unambiguous data meaning between business applications, search capabilities, and static/dynamic presentation. The Metadata Services Group within BellSouth has spent the last 4 years developing an Enterprise Metadata architecture solution which includes the pervasive role of XML. This conference will develop the attendees understanding of XML and how it has been used in a successful enterprise metadata implementation.

  • The fundamentals of XML
  • Applying the principles of XML to metadata
  • What’s in it for metadata: RDF, Dublin Core
  • Is XML all there is to an enterprise metadata architecture?
  • A successful implementation of an enterprise XML repository

How to Build a Great Relationship with your Data using Profiling and Quality Assessment Techniques

Brad Darrach
Information Quality Analyst, IT Business Systems Support
Fallon Community Health Plan

 

The honeymoon is over; now you’re married to your data. Unfortunately, it may have been an arranged relationship and you didn’t have much time to get acquainted first. Here is a session in “speed-dating” to help you develop a strategy to get to know your data quickly and efficiently - before some hidden personality disorder ruins your life! There are some fundamentals that we will show you to help you reveal some of the mysteries in a hurry and empower you to decide what should happen next. Then, we will show you how to monitor your relationship and watch for signs of trouble and keep you out of counseling. Brad will share his successes and challenges to help you target your next project for success. He will discuss:

  • Metadata issues – critical for integrating data
  • Defining data quality – establishing appropriate metrics
  • Conversion testing vs. ongoing maintenance
  • How to let your database work for you – sample queries and scripts
  • Empowering your customers to help

Active Metadata
Discussion and Vendor Roundtable

Andrew Manby
Director of Platform Product Management
Ascential Software

Greg Blumstein
President
Data Advantage Group

Naresh Govindaraj
Senior Product Manager for Metadata and Interoperability
Informatica


Adrienne Tannenbaum's presentation will be followed by an interactive vendor panel. Invited metadata solution vendors will discuss their products within an "active metadata" limelight. Attendees will then be given an opportunity to present specific questions to one or more vendors. Adrienne will serve as the mediator of this interactive discussion.


New Directions in Record Matching and Relationship Linkage

David Raab, Raab Associates (moderator)

 

Frank Dravis, Vice President Information Quality, FirstLogic, Inc.

Ramesh Menon, VP, North America, Search Software America

 

Andrew Borthwick, President & CEO, ChoiceMaker Technologies, Inc.

 

 

The "Know Your Customer" movement is driving business strategies that need "total customer knowledge" or the "360 degree view". From an IT perspective, identifying the relationships between customers, contacts, organizations, consumers, households etc. across multiple systems and databases becomes a basic requirement of today's systems. And if you’ve been involved in systems or projects that need to integrate or merge such entities, you’ll know that it’s not enough to perform a simple string comparison of two records to find a relationship or a duplicate. This will uncover exact matches only.

There a lots of products on the market that enable relationship discovery, matching and linking. In this moderated discussion we’ll talk to three of the most prominent companies in the field to learn what the state of the art (or science) is in terms of the technological capabilities of products, and what they see their customers doing in terms of best practices, methods and processes.

  • Is a single strategy for finding matches enough? How do you know what you’re missing?
  • What’s the downside/upside to using multiple matching programs?
  • Discovery processes
  • To merge/purge or “link”
  • The problem of approximate matching, and strategies to deal with it.
  • Strategies to measure and maximize quality
  • Measuring record linkage and deduplication accuracy
  • Methodologies for determining the extent of record duplication within a database
  • Measuring the accuracy of a record matching solution, including false matches


Integrating Data with Processes: Data Model-Driven Applications using CASE

Jose Borja
Senior Analyst Programmer
Mayo Foundation


Are you tired of developing detailed data models that end up collecting dust after technical design and are never used again? Then join me and learn about data model driven applications and the CASE tools that make it possible to permanently glue your data model into the processes and procedures that make up an application.

Witness proof of the importance to use tools that include the data model in the application and learn how CASE tool environments have a leg up in providing automated solutions to enterprise data integration issues, database management procedures, and impact analysis procedures.

The session uses live demonstrations of an CASE generated Client/Server application to cover these topics:

  • CASE environment and the Encyclopedia concept
  • Data Model development in an CASE environment
  • Marrying the Data Model into the processes that implement the business rules
  • Logical to Physical model transformation
  • Impact Analysis (change Management)
  • Managing denormalization in CASE. Implementing a data model change, modifying the processes, and regenerating the application in minutes not days!

Speaker Comment: I will present in a vendor neutral environment how CASE tools help solve issues on enterprise data integration, data and process management, modeling, and model contents analysis. I believe participants will love seeing how their Data Models can become a live part of their applications and not just an analysis phase deliverable and database schema for DDL.


Enterprise Common Data Architecture - Roadmap to Integration

Daniel Paolini
Director, Data Management Services
State of NJ Office of Information Technology


An Enterprise Common Data Architecture (ECDA) is a collection of related tools and technologies, along with standards and policies and the methodology and the expertise to employ them. The architecture enables real-time operational integration as well as the delivery of integrated analytical data to different communities in the format that each requires.

The creation of an ECDA is a major yet essential commitment to any long-term strategic initiative to support data reusability. This architecture forms the foundation for collecting, storing, managing and controlling privacy of and access to data on an enterprise basis.

Data reusability is the process of collecting, managing and storing electronic data in all forms and formats to promote the efficient use of that data between business units and applications while minimizing data redundancy. It is more specific than data sharing, in that data sharing can involve data duplication, while the goal of data reusability is to eliminate unnecessary duplication. It is more specific than data integration, in that data integration is the melding of information from disparate systems, while the goal of data reusability is to integrate while optimizing and standardizing the common data.

The presentation discusses a generic ECDA based upon the New Jersey Common Data Architecture being deployed by the Office of Information Technology:

  • The problems and challenges of a legacy environment
  • The opposing needs of operational and analytical systems
  • The process to create integrated operational systems
  • The rational creation of analytical solutions
  • How to get IT staff, executive management and business users to see the big picture

Enterprise Information Integration (EII)
Emerging real-time solutions, standards and tools (including an introduction to XQuery)

Nitin Mangtani
Technical Program Manager
BEA Systems Inc


Information is locked in the islands of relational databases, EIS systems and proprietary applications. The need to integrate information in real-time has greatly increased with the advent of web and more focus on self-service based applications. This session talks about information integration challenge in building self-service portals, composite web services and business processes. We will look at traditional solutions of information integration such as ETL, EAI, custom coding and their pros and cons. Then we will talk about emerging solution called Enterprise Information Integration (EII). This discussion will cover basics of emerging W3C standard XQuery and the need for native query language for XML. We will discuss how developers can use XQuery to create reusable views of enterprise information. Finally we end this session by covering a real world example of applying XQuery to deliver rapid information integration solution.

  • Information Integration challenge
  • Business requires aggregated access to real time information
  • Data Sources are heterogeneous
  • Traditional information integration solutions – ETL, EAI and custom coding
  • What is EII?
  • XQuery Basics
  • Real world example and solution using EII technology

Enterprise Data Maturity

Burton Parker
Principal
Paladin Integration Engineering


Peter Aiken
Founding Principal
Institute for Data Research/VCU


This presentation will summarize results of research into the state of data management maturity in 200+ large commercial and non-profit organizations, and government agencies. The presentation will include a brief review of the Enterprise Data Management Framework that underpins the research efforts. The Framework consists of a Template for Enterprise Data Management, including roles and responsibilities, and a Template for Enterprise Data Management Maturity. Profiles of the research results into the state of data management maturity will be summarized in a number of matrixes based upon the Templates of the Enterprise Data Management Framework. The presentation will also include summaries of specific commonly found findings as reported in a number of data management process improvement engagements. Attendees at this presentation will understand:

  • The elements of enterprise data management maturity.
  • The results of Virginia Commonwealth University’s research results on enterprise data management maturity.
  • Summaries of selected enterprise data management maturity evaluations.
  • Specific comments commonly heard from participants in enterprise data management maturity evaluations.

Semantics in Business Systems

Dave McComb
President
Semantic Arts


Long marginalized, Semantics is emerging as the trump card in application development and integration projects. Getting ready to normalize your data model? How do you know you’re normalizing the right data? Getting ready to integrate two systems? How do you know what the data definitions really mean?
Whether addressed early or late in a project, the meaning of the information will eventually assert itself and need to be dealt with. This presentation is an introduction to Semantics and its application to business systems. It will be a cook’s tour of how Semantics permeates these areas of development:

  • Conceptual Design and Semantic Modeling
  • XML and the definition of meaningful schema
  • Business Rules and the advantages of basing them on an semantically rich ontology
  • Ontologies: what are they anyway?
  • Systems Integration and Semantic Brokers
  • The Semantic Web

Quantifying the Operational and Financial Risks in Enterprise Data Architectures

Chito Jovellanos
President & CEO
forward look, inc.


How did the banking industry formally assign "economic value" to its data? The banking sector has recently adopted a new approach to measuring the contribution of operational data and systems to the fiscal health and performance of financial institutions at-large. The "Basel II Accord" is a major update to regulatory capital requirements that incorporates dramatic new provisions for quantifying the broad operational risks which financial institutions encounter.

This presentation highlights the evolution and deployment of the Basel II Accord, with particular attention to the business-critical data and systems that these technology driven firms rely on. More importantly, this session offers insights and techniques that will enable data architects and information systems managers from a broad spectrum of industries to:

  • Use simple methods to identify and measure the potential risks and financial liabilities inherent in a company's data architecture and supporting systems
  • Communicate effectively about data management and operational risks in terms that a CFO can appreciate and use
  • Apply these methods to other initiatives such as BCP (Business Continuity Planning) and DRM (Disaster Recovery and Management).

A worksheet highlighting the application of the techniques discussed in this session will also be presented.


Data Modeling: A Developer's View

Roland Berg
Principal Consultant
ThinkSpark


How often have you created an elegant model only to find out that the developers couldn't understand how to use it? The initial thought is frequently “What’s wrong with the developer? This is simple!” Unfortunately the problem isn’t always the developer. Our experience has shown that the problem is often related to inadequate documentation and/or communication of what the structures really represent.

This presentation looks at data modeling from the developer's perspective exposing the data modeler and architect to the issues and concerns of the developer. It provides guidelines for modelers and architects to help them communicate appropriate information to the development team so that they may design and write their code as efficiently and effectively as possible. The approach presented involves training and coaching strategies as well as identifying an appropriate type of documentation for the other members of the team.

Once the developers have an appropriate level of information they frequently find enjoyment in using the structures we have defined. The result is a better and more effective working relationship between the modeler-architect and the designer-developer.


Supporting Multiple Views of Enterprise Information
(or...Get Out of My Life, but first could you drive me and Cheryl to the mall?)

Martin Dunn
Vice President
Alliance Consulting


You may not need teenage children to relate to this humorous book title. If you have ever undertaken an enterprise integration project, you may share a similar relationship with your application owners.

In the politically charged waters of enterprise integration, it is often necessary to support multiple views of reality. Creating a "single view of ..." has become a familiar mantra of integration technology vendors. By definition, there can be only one "single version of the truth" but recognizing, understanding, managing and supporting differences of opinion is a key aspect of any integration project.

In this presentation we discuss data management techniques and processes that arise from supporting multiple views of enterprise information. We will review some successful techniques developed to cater for this difference of opinion and discuss the implications of these solutions.


XML Vs. Relational - The Top Ten Differences

Jim Stewart
Director of Consulting
ASIX, Inc.


This presentation explains the fundamentals that differentiate XML and Relational technology. Although the differences between XML and relational seem obvious, the problem of selecting the right one it is not that simple when you take a closer look. Reading articles and attending seminars shows that even the industry “experts” have basic misunderstandings. Arguments for and against the use of the two approaches show confusion about the best uses for each, how they are different or similar, and apparently conflicting claims in several areas. The advent of XML DBMSs makes the issue even more confusing.

This presentation clearly identifies the top differences (and similarities) between the two technologies in a way that will help data professionals understand them, select the correct solution for an application and make sound architectural decisions on their use.

The Top Ten “differences” that will be reviewed are:

  • Primary Purpose
  • Data Paradigm
  • Metadata Approach
  • Ease of Understanding
  • Relationships
  • Object Identification and Navigation
  • Query Ability
  • Flexibility
  • Data Integrity
  • Performance.

The Structural Integrity of Source Data

Joseph Novella
President
Anthem Consulting, LLC


Many data profiling and assessment efforts tend to focus on domain studies tasks and techniques. However, understanding the content and scope of individual columns and fields is only the first step in developing a complete data assessment. This session introduces techniques used to evaluate the structural integrity of source data.

  • Functional dependencies
  • Evaluation techniques
  • Normalizing non-relational data
  • Effective sampling techniques
  • Data quality error or embedded table?
  • Products on the market

Positioning Metadata as a Business Asset
(and get funding for your project)

Daniel Riehle
Principal
GetReals Inc.


Metadata is seen as a technical necessity, not as a business asset. This is a strategic error on the part of IT organizations. The reality is that funding for Metadata will never be significant if you can not morph metadata into a enterprise-wide business asset. This presentation shows how to use an extensible repository, COM objects, XML and ASP to generate self-guidance of business people through the mountains of technical metadata inherent in your IT organization. As an example of the technique, the presentation relies on the need to reflect “Lineage”, the journey that business-valued information takes from first source (Legacy systems) through final targets (BI Tools).

To properly position metadata as a business asset (and hence secure funding for the IT effort), you must make the requisite detailed metadata transparent to the business user yet keep these users’ need foremost in the overall architecture of the enterprise metadata. It is a sneaky detail that by focusing on the business value of metadata, you accumulate the technical metadata required to streamline the activities of IT.

The underlying technology of this presentation is Teradata’s Metadata Services but the concepts and techniques are universally applicable to any extensible repository engine. Key take-away points are:

  • Extending the repository to reflect business needs
  • Data acquisition and parsing of metadata sources (lineage and transform acquisition)
  • Summarization of lineage
  • Applicability to other enterprise metadata tasks
  • Business and technical funding points (budgeting).


A Strategic Reference and Master Data Framework for Enterprise Information Delivery

Kapil Khanduja
Practice Director, Data Management
Data Foundations

Problems of information disparity have plagued the IT industry since its beginning: The inability to integrate data across business units, inconsistent business definitions in the organization, the inflexibility of applications to respond quickly and cost-effectively to business changes; all these problems result in incomplete and/or inaccurate business reporting.

Current solutions include point-to-point mapping, transformation, cleansing, build/buy integrated transactional systems and custom applications to centralize the maintenance of master files. But the “strategic” approach to this problem involves the creation of a foundational enterprise information layer for the business – a visible shareable reference data layer –that enables an “information-aware” organization.

Certainly some challenges exist in terms of getting consistent definitions, identifying business ownership, change management, transitions; regional standards, and deployment throughout the organization. But overcoming the challenges is possible using a methodological roadmap, and using tools to assist the transition and implementation; standards and process changes.

The presentation will be of interest to attendees involved in enterprise information delivery, data models, architecture, metadata, ETL, data deployment and reference/master data management.


Role Transformation: Data Analyst and Architect

Jane Carbone
Partner
infomajic, llc



Should the work of data analysts and architects remain forever separate, and if so, how can we ensure they communicate to best serve the enterprise today? This presentation looks at data integration from the organizational perspective and proposes some changes. The speaker evaluates the traditional roles of data analyst and architect and describes why change is necessary. It discusses how to effect change by proposing new model role descriptions. The presentation identifies gaps in current skills and knowledge and suggests how to address them. Finally, it evaluates organization structures that facilitate change.

- Traditional roles and where they break down
- Defining new roles
- Identifying gaps in skills and knowledge
- Overcoming gaps and barriers
- Positioning roles in the organization structure

Speaker Comment: We get lots of questions on people issues, and believe change needs to be addressed here as well as in technology.


Why Data Marts Proliferate: Business Semantics and Data Integration in Conflict

Robert Klopp
Director
Skyland Technologies


In most Business Intelligence implementations, data marts proliferate, creating a data management headache. Often this proliferation occurs despite the availability of a comprehensive, integrated, data warehouse.

In this session the author will describe modern data warehouse architecture, with dependent data marts, in terms of business semantics. The differing semantics inherent in operational, warehouse, and mart schemas will be introduced; data integration costs will be described in terms of semantic transformations; the costs will expose a flaw in current data warehouse architecture; and the proliferation of data marts will be explained as the result of that flaw. Finally, the application of federated database technology to the problem of data mart management will suggested as a way out.


The Role and ROI of Enterprise Schema Management

Jeff Dirks
CEO
Schemalogic


Enterprise Schema Management is a new concept driven by the need to improve information sharing, cut costs and increase responsiveness to changing requirements. Involving repository technology and a highly collaborative process, success requires leadership from the IT and business architects who establish best practices. With increasing usage of XML Schema, now is the time to treat schemas as an enterprise asset to encourage re-use and ensure interoperability.

In this session we will review the role and ROI of Enterprise Schema Management to help attendees understand, rank and financially justify this critical aspect of information integration. Attendees will learn the key aspects of Enterprise Schema Management, which include creating an enterprise schema repository, remote collaboration, balancing corporate standards with a natural diversity of viewpoints, change management, schema search and synchronization processes.

Based on real-world case-studies describing information integration within manufacturing, financial services and media companies, this presentation will describe best practices including the leadership, politics and process challenges that can impact success even more than any given technology. Most importantly, attendees will understand the key factors that produce business value and return-on-investment from Enterprise Schema Management

Attendees will leave this session knowing:

  • How does Enterprise Schema Management enable reuse of XML Schema?
  • Are enterprise-wide, cross-application schema standards possible?
  • The downside of freedom: chaos as XML developers create arbitrary schemas
  • How have pioneers managed the politics and process of schema stewardship
  • Why management cares: financial benefits and ROI

Speaker Comment: Analysts and press are starting to write about how the proliferation of "arbitary" XML Schema is getting out of control. At recent conferences a number of people have come looking for a way to manage schema in general (gather, reconcile, search,...) and often triggered by the explosion of XML Schema.


Data Quality Assessment & Measurement:
Developing Data Quality Process Measures

Shaun Williams
Data Integrity Manager
H-E-B Grocery Company

Joan Brooks
EDI/Data Integrity Program Owner
H-E-B Grocery Company


Ranked by Forbes Magazine as the 9th largest private employer, H-E-B Grocery Company has over 300 retail outlets in Texas and Mexico, 55,000 employees, and close to 10 billion dollars in annual revenues. H-E-B’s systems process utilizes hundreds of information systems, many of which are legacy systems existing on antiquated platforms with little or no “on-line” data quality checks (edits).

In 2001, the senior leadership team at H-E-B formed the Data Integrity Group with these primary objectives:

  1. Assess the data quality issues in H-E-B’s information systems, cleanse data issues, and help position the company for migration to more “modern day” applications.
  2. Design and integrate data integrity as a way of life at H-E-B, through the use of data stewardship principles including process reviews, training, and measurements.

The Data Integrity department has spent the last 1½ years assessing and cleansing data that will be integrated with key Merchandising and Supply Chain applications. In addition, the group has developed a methodology, process, and technology for tracking data quality exceptions within a given process, and assigning cost metrics to these exceptions.

This presentation will discuss the processes, methodologies, tools and measurements implemented by H-E-B’s Data Integrity department in order to improve and sustain data quality, by discussion of the following topics:

  • Definition of a process from a data quality perspective
  • Identifying critical data within a process
  • Engaging the business
  • Defining and selecting the process measures
  • Establishing costs and goals
  • Tools and technology for measurement and monitoring
  • Resolving data quality issues
  • Embedding data quality into the project management process
  • Implementing data stewardship and designing data quality into business processes, job descriptions, and training

Metadata-Driven On-demand Data Integration

Patricia Klauer
CEO
Eclipse Data Systems, Inc.


Dina Bitton
Chief Technical Officer
,Xtegra Corporation


Businesses, like cities, grow organically. Streets are added in response to growth, rather than planning, in the same manner that businesses add databases and applications. For the most part, growth is unmanaged until reaching a certain critical mass. At that point, it becomes obvious that additional growth requires infrastructure management. To successfully manage infrastructure growth, one must not only plan for the future, but also integrate the past.

Growing businesses have the advantage over cities in that hardware and software is easier to manipulate than streets and bridges. But even then, it is a non-trivial problem, given that we are still surrounded by decades of legacy data and systems. We will describe a metadata-driven data integration infrastructure that helps to preserve the legacy code, while allowing the organization to move forward with new uses for the data. This approach not only solves present problems of data integration such as data migration and data mart proliferation, but also provides a platform for future growth and changing business requirements.

The authors will address key issues such as:

  • Persistently maintained data location and identification
  • Standardized conceptual schemas
  • Data transformation
  • Preservation of local ownership and access controls

Speaker Comments: This is not a conceptual solution. This presentation describes new technology that is now available to support metadata driven real-time data integration.


Scientific Data - Challenges and Solutions

Olga Brazhnik
Chief Database Architect
Synchronous Knowledge, Inc.


Modern life profoundly relies on scientific knowledge and technologies build upon it. Scientific knowledge is acquired through the collecting and the complex processing of enormous amount of data, often measuring in terabytes or even petabytes. What is the process of extracting reliable knowledge from such an abundance of data? What are the challenges in the world of scientific data management?

Business and scientific databases are siblings but far from being twins; they face quite distinct challenges and develop divergent solutions, which they may share, albeit with some caution. In science, for example, there are significantly different ‘business rules’ depending on which scale applies, e.g. the result of mixing two chemicals cannot be calculated via simple algebra: distinct outcomes arise from different reaction conditions.

The presentation will cover data modeling, data processing, and data integration in science, emphasizing the complementary scientific solutions not found in the realm of business data. It will involve real life examples from the domain of bio-medical informatics, where the need for interdisciplinary data integration is the most pressing. This will be a lively, thought-provoking presentation and discussion of concepts that are new for business databases. Come with an open mind and learn about new challenges for database professionals in the realm of scientific data management.

  • Complexity of scientific data
  • Differences in processes of collecting and processing data in science and business
  • The goal and promise of data integration
  • Meta-data standards in life sciences
  • The use and limits of applicability of UML, object-oriented databases, XML
  • Open source development

Enterprise Semantic Models: Buy, Borrow, or Build?

Eli Israel
Lead Modeler
Semantic World


What makes a good Semantic Model? How can I make use of resources that I already have? What are some best practices?

This session will describe an introduction to semantic models and the roles they can play in an enterprise. It presents the qualities that identify a good model and the organizational factors affecting a model's adoption. It then asks if available models from a specific industry or organization are suitable for the specific business task at hand.

Additional questions that will be answered include:

  • Which industries already have mature semantic models?
  • What types of organizations are best suited to adopting such models?
  • Which team members should be part of a model adoption team?
  • What is a realistic project plan for integrating such models into the business?

The relative strengths and weaknesses of using relational models, object models, and XML Schemas for this purpose will also be discussed. Finally, real-world examples of best and worst practices garnered from modeling and consulting experience with Fortune 100 companies will be demonstrated.


Lessons Learned from Delivering Real-time Marketing Dialogs into Existing Operational Environments

Kevin Cavanaugh
VP of Technology
Unica Corporation


Effective marketing requires judicious use of customer data and coordinated treatment logic across many different operational systems and touchpoints. Unique data and business rule challenges are introduced when distributed operational systems are coupled with the analytical marketing processing requirements needed to support real-time customer dialogs across these systems. This session explores system architectures and new data model requirements for coordinating outbound and inbound customer treatment strategies based on batch, real-time event-triggered marketing scenarios across different touch points. Case study examples will be used from recent deployments of real-time and dialog marketing deployments to highlight important lessons learned in their integration into existing high volume production environments.

  • Extended customer data model requirements to support cross-channel dialog marketing.
  • Architecture, system design and data integration strategies to support both synchronous and asynchronous real-time marketing application demands
  • Techniques for integrating coordinating real-time, scheduled event triggered marketing dialogs across touchpoints
  • Important lessons from recent deployments of real-time marketing in high-volume production environments.
  • Guidelines for establishing and prioritizing cross-channel dialogue based on business value metrics and implementation readiness/complexity.

Standards-Based XML Management - an Insurance Case Study

Senthil M. Kumar
Director, Product Management
Analytic Software and Decision-making Tools
Fair Isaac Corporation

Insurance carriers spend millions of dollars each year processing policy applications and claims submitted by brokers and agents. This document-centric business is ideally placed to exploit XML, for which the ACORD standard has been defined for the exchange of policy information across the insurance supply-chain. ACORD addresses the challenge of standardizing extensions to documents to accomodate rapid changes in business.

This session will focus on how an insurance carrier manages their XML documents with automated validation yet allowing for rapid development of extensions to their XML schemas, taking advantage of web services, business rules and predictive modeling.


How to Uncover the Truth behind your Data
(and deliver $30 million in benefits by rectifying data errors)

John Longley
Data Cleansing Manager
Ministry of Defence


Paul Nettle
Data Cleansing Manager
Ministry of Defence


September 2000. The challenge: Integrate inventory data for millions of line items held across the UK Armed Forces. The problem: No one knows how good (or bad) the data is. The solution: Create a dedicated team to tackle the issue. Thus The Cleansing Project (TCP) was born, a small team working within the UK Ministry of Defence (MoD), that has delivered in excess of $30M of benefits from identifying and rectifying data errors. Starting with nothing more than determination and a pot of money in September 2000, TCP has become centre of expertise on data quality within UK government. The team are now a key enabler for e-business initiatives and EAI within the MoD, and an enabler for collaborative working within the UK defense sector. This presentation will describe how any enterprise can reap similar rewards by adopting the TCP approach.

  • How to establish such a team and why the selection of its members is so important to success
  • How to find and secure help from within the business
  • How to select the right data profiling tools for the job and use them to best effect
  • How to get data errors rectified and monitor data quality
  • How to overcome the culture and other barriers that impede progress

Experiences with Meta-Data Management Across Tool Types

Christine Mandracchia
Manager - Data Administration
American ReInsurance Company


Effective meta-data management continues to be an undertaking requiring balance between the involved organizational roles and processes, and the capabilities of the specific tools available. In this presentation, Christine will describe certain characteristics of the meta-data environment in the organization prior to the acquisition of ETL, meta-data hub, and Business Intelligence products. She will discuss the subset of meta-data requirements to be supported for the business community, particularly the business names and definitions for the data warehouse contents. She will describe the "round trip" of meta-data from the data modeling tool, through a meta-data hub, to the ETL tool, and to the Business Intelligence reporting tool, and back.

The focus of the presentation will be on the issues that needed to be resolved in order to effectively execute this "round trip". These issues include which organization "owns" which type of meta-data, which tool is the "master" keeper of each type of meta-data, and how and when will the meta-data be moved among the tools under the various day-to-day situations of warehouse additions, updates, and deletions.


Open Source Data Warehousing and Databases

John Poole
Distinguished Software Engineer
Hyperion Solutions Corporation


The Open Source revolution is rapidly transforming the software industry in terms of both development practices and business models. Once regarded exclusively as the realm of software hobbyists, Open Source has become the software model of choice for many organizations. Data Warehousing and business intelligence (DW/BI) can benefit greatly from Open Source technologies, which enable both the construction of robust IT infrastructures, as well as an emerging class of software solutions for DW/BI, advanced analytics, and business performance management. This session provides a comprehensive overview of the Open Source contribution to DW/BI. Participants will learn why Open Source solutions enhance long-term, return-on-investment (ROI) of DW/BI projects, and provide corporate stakeholders with greater control over the maintenance and integration of their DW/BI solutions. An example case study in the use of Open Source technologies to implement a data warehouse is also presented.

  • Overview of Open Source
  • How key DW/BI concepts are addressed by Open Source solutions
  • Case study of a DW/BI solution based on Open Source technologies
  • Open Source support for platform, application, and Web services standards
  • Open Source support for open integration standards: OMG MDA, UML, CWM, meta data patterns, and popular repository and data models


Thursday Sessions

Modeling Business Rules

David Hay
President
Essential Strategies, Inc.


Other than terms, facts, and certain constraints, data models are limited in their ability to portray business rule constraints. Indeed, the more generalized a model becomes, the less it is able to show the business rules that constrain the domain being modeled. Where the topic of the model is itself rules, however (as in regulatory agencies), then the rules themselves can be modeled. In addition to showing the basic limitations of modeling to show constraints, this presentation will describe ways in which rules can be modeled, and will show examples in areas such a issuing permits, passing legislation, and so forth.


Corporate Compliance and Governance
Will New Legislation be the Killer App to an IT Recovery?

Karen Lopez
Principal Consultant
InfoAdvisors, Inc.


Who’d have imagined, a few years back, that IT would become a government-regulated environment? But now we have Sarbanes-Oxley and the USA Patriot Act and the Basel II Accord – to name just a few of the new laws that are encroaching on IT. Consider privacy regulation alone – there are over 220 bills covering information privacy pending in the US Congress as of July 2003. Throw in the new “don’t call me” list, European (and California) data protection, Gramm-Leach-Bliley, HIPAA, etc, and IT's role in corporate governance has forever changed.

It’s the ultimate good news/bad news story for data management. The good news is that we’re at the center of this new activity. The bad news is that we’re at the center of this new activity. It’s truly a double edged sword – and we don’t yet know which edge is sharper. But we had better handle things well, because the senior executives at our organizations are in most cases, either financially culpable or criminally liable, or both.

Lets start with the big picture on what all this new compliance requirements really means to data managers and how you can do your part to help your company meet its obligations.


Service Based Architectures - Defining the Issues for Data Professionals

Robert Abate
Principal Partner & CTO
Intellisys, Inc.

Ruben Tungol
Senior Architect
Novell Corporation

The Service Based Architecture [SBA] is the latest in unifying designs which, using industry standards and best practices, allows for complete business process and application integration. It documents corporate data, fosters the fast and real-time integration of legacy applications while providing for foundation for the future development of object based, n-tiered systems. This architecture was designed to allow for the replacement of legacy systems by removing pieces of legacy functionality while allowing for the simultaneous usage of both the legacy system and new applications.

We will review the development of the SBA, the technology terms used in this architecture, the foundation components, review the object-based Common Data Models [CDM] and data cleansing/integration issues, uncover the SBA components, explain how to phase in real-time messaging and show the follow-on benefits derived from the SBA.

SBA and its foundation architecture:

  • Development of the SBA architecture
  • Lessons learned from a number of different implementations
  • Review best-practices of SBA implementations
  • Demonstrate the benefits of it's implementation
  • Debate implementation issues such as: Standards (J2EE, SOAP), Web Services, etc.
  • Examples of fast implementations of batch (non-real time transfers) and the slower setup of messaging (real-time).

SBA Architecture components:

  • Layered (integration services, runtime management services, metadata services and security services)
  • Multi-faceted (includes development, testing and run-time environments, provides for completely scalable features and administration / monitoring of events)
  • Industry Standards ((J2EE, Corba, SOAP, XML) Best Practices in SBA implementations (OO Development, architecture centric, model driven, iterative and component oriented)

Implementing an Integrated Enterprise-Wide Database

Ulka Rodgers
President
eTransitions, Inc

Andrew George
Director, Application Development
Corporation Service Company

 

Many corporations have talked about implementing an integrated database for their enterprise applications. This presentation is about what happened in reality. The presenters will discuss some of the challenges of managing multiple development teams, data conversion of critical data such as customers from multiple legacy systems, and the kinds of hard management decisions posed along the way. Using examples of applications, we will discuss:

  • Data and code integration issues
  • How the database played a central role
  • How the organization and projects were structured to address the challenges
  • Practical suggestions on what works and what does not for attendees embarking on similar integration projects.

Best Practices in Business Intelligence for ERP & CRM Suites

Aaron Zornes
Senior VP, Enterprise Analytics
META Group


The wholesale “information supply chain” change-out of information feeder systems that occurs with enterprise resource planning (ERP) deployments is an opportunity to vastly improve an enterprises’ analytic capabilities and concurrently reduce shadow IT costs. Enterprises remain challenged by their ERP investments’ ability to access and analyze enterprise information for business decision making. Concurrently, Oracle, PeopleSoft, and SAP are focusing on the analytic applications market and challenging best-of-breed analytics vendors such as Business Objects and Cognos. Un-architected “BI for ERP solutions” remain a huge shadow IT cost yet prescient BI projects stand to offer “break away” competitive advantage.

  • Assessing the BI capabilities of mega application package vendors data warehouse offerings, e.g., Oracle OBIS, PeopleSoft EPM, and SAP BW
  • Applying the right mix of best-of-breed analytics to maximize the ROI of ERP and CRM investments
  • Superseding vendor-centric analytics via an enterprise-wide BI infrastructure strategy

Federal Enterprise Architecture: The Business Process Models

Michael Lang
Founder & EVP of Development
MetaMatrix


The Federal Enterprise Architecture (FEA) is a business-based framework constructed as a collection of interrelated "reference models" that will facilitate cross-agency analysis, and the identification of duplicative investments, gaps, and opportunities for collaboration among Federal agencies. The Reference Models act as a Target Architecture for which each agency will align. OMB and agencies will use the FEA for describing and analyzing information technology (IT) and other capital investments, and to improve Federal government service to the citizen. It includes a strong focus on delivering services to the citizen along with government- to- government process and information exchanges along with consolidating and integrating the services along lines of business.

In this presentation Michael Lang will present the theory brought forth in the Data Reference Model as well as discuss the enterprise architecture vision of the Business Reference Model. The Business Reference Model serves as the foundation for the other reference models - the Data Reference Model, Performance Reference Model, Application-Capabilities Reference Model, and the Technical Reference Model. The Information and Data Reference Model (DRM) is an approach and a Federated Information Model that can be populated along government Business Lines and be used across Federal, State, Local and International e-government initiatives. The approach is based on both sound information and data base theory, a serious need, and an approach that correlates with standards organizations to create an open and extendable family of information models. These models can be one element of each organization's push for information integration and increased consistency, commonality, and visibility.

This presentation will cover how government agencies are using data management, metadata management, Model-Driven Architectures, Real-Time Enterprise approaches, Service Oriented Architecture, Web services, information integration, and process integration.


Entities, Objects, and XML Schemas in One Model--Are You Crazy?

Richard Hecht
President
DATA Architects Technicians Analysts, Inc.


Is it possible that using different approaches in one modeling method can result in documenting and communicating data better than any one of these approaches by itself? This presentation explores a modeling methodology that combines the best of multiple approaches and furthermore maps the logical data to real physical implementations. This is not textbook and theory. This is actual practice and reality that arose from a need to help business users and developers better understand data and how it is implemented.

Specifically, this presentation addresses:

  • Graphical techniques to display model information
  • Approaches to present models to developers and users
  • Concrete examples from real development projects

The examples show how the synergism created from this novel approach helps produce schematics that improve the documentation and communication of enterprise data and its physical implementations. This presentation is not for the purist who religiously follows the one-method-can-fit-all-no-matter-what theory. This presentation is for those who are interested in new concepts and creative techniques. Applying these concepts and techniques can help produce meaningful and useful information to answer the questions developers and business users have about data and proves that combining multiple approaches may not be as crazy as it first appears.

Speaker Comment: This presentation may very likely be controversial as many data people are self-described “evangelists” for one modeling method or another. Some seem to border on religious fanaticism, so the very idea of combining multiple methods may ignite sparks. However, if our profession is going to progress, new concepts and creative techniques need to be explored, and conferences and forums like this one are ideal to present these new ideas. At the very least they can provide fuel for debate and cause people who are in a rut to possibly rethink and refresh.


Meeting the Requirements for Sarbanes Oxley
Understanding the Business Semantics and Data Quality Issues for Compliance

David Steinberg
Solution Director
Idea Integration

Peter Vink
Business Development Manager
Idea Integration

David Kotler
Dechert LLP

For many in the Business Intelligence community, there is a comprehensive understanding of the data; we understand from a technology perspective what infrastructures work and recognize that how the data is captured and purveyed will greatly impact data quality. In many cases we understand where the shortfalls are in our processes.

This presentation will take a look at the data requirements of Sarbanes Oxley and other recent regulatory rulings. In addition to exploring the ramifications of data quality and business semantics in light of this regulation, technology framework, approach alternatives and rationale for initiating these efforts will be discussed. Profiles of organizations that should be concerned with compliance will be highlighted.

For some organizations there may be a requirement to disclose deficiencies in the design or operation of internal controls within their compliance process. Because many companies rely on their enterprise data as the foundation of theses internal controls, there will be a growing need for the technical staff to understand the business ramifications of these initiatives.

  • Data Requirements for Sarbanes Oxley
  • Ramifications of data quality and business semantics for compliance
  • Technology framework to meet functionality
  • Profiles of organizations that should be compliant

Deriving a Data Model for Service Architectures

Peter Aiken
Founding Principal
Institute for Data Research/VCU


A metadata-based understanding is gained by a development process that applies eight transformations - organized into two phases - to each enterprise architecture/legacy system component. The eight transformations are applied in order to when effectively and efficiently developing an architectural component that is capable of delivering architectural and business engineering value. The transformations are illustrated. Transformations and other forms of data analysis occur using model refinement and validation (MR/V) sessions in conjunction with key subject matter expertise (SME).


Supporting Corporate Data Integration Projects

Melvin Jones
ETL Architect and Project Lead
T. Rowe Price Investment Technologies


This presentation will explore services, processes and procedures necessary to successfully support data integration projects operating in a shared environment using repository-based ETL tools. Those implementing a repository-based ETL tool for the first time will be interested in how to start off in the right direction to support multiple projects. Those who have already implemented such a tool can benefit from this presentation to enhance their support level of data integration projects.

  • The Data Integration Environment- What needs to be supported?
  • The data integration infrastructure
  • Service drivers- What drives the need for processes and procedures
  • Data integration support services defined: Standards, change management, metadata, stewardship, auditing, repository administration, pre-existing procedures
  • Roles and responsibilities defined: Project level, individual
  • When under the gun - Supporting projects in a high-pressure environment

Speaker Comment: This presentation is geared towards corporate practitioners in the data integraiton space, and reflects T. Rowe Price's real-world experiences and hands-on implementation with data integration architectures, process and procedures.


SAP Business Information Warehouse “Top 10 Pitfalls” – What Every DW Professional Must Know

Aaron Zornes
Senior VP, Enterprise Analytics
META Group


Enterprise analytics are increasingly integral to the strategies of both mega application package vendors (Oracle, PeopleSoft, SAP, Siebel) and the G2000 enterprises which deploy these packages. In this presentation, we also summarize future considerations for SAP’s Business Information Warehouse (SAP BW) and related analytic products.

An SAP BW implementation should be treated as a business project rather than an IT project. However, given the limited experience with SAP BW to date, users are well advised to stabilize their R/3 installations prior to putting their SAP BW into production – while benefiting from the lessons learned by other IT organizations. This presentation will cover these key lessons learned for SAP BW initiatives as well as the full “top10” list:

  • Provisioning for data quality and data completeness (all data sources)
  • Fine-tuning BW InfoCubes and Operational Data Stores
  • Understanding and leveraging the SAP BW architecture as part of the SAP R/3 rollout

Department of Defense Net-Centric Data Strategy:
The Right Information, to the Right People, in the Right Format, at the Right Time

Alan Perkins
ASG Federal

 

Alan Perkins will explain why he believes that the new "Net-Centric Data Strategy" mandated by the DoD Chief Information Officer, coupled with other DoD information management strategies, may be the solution that actually allows mission critical information to be delivered to the right people, in the right format, at the right time. He'll explain what the new requirements are, and how they relate to other new Federal IT initiatives. This session describes:

  • ·Department of Defense Architecture Framework (DoDAF)
  • ·C4ISR Architecture Data Model (CADM)
  • ·Defense Architecture Repository System (DARS)
  • ·Federal Enterprise Archicture (FEA)
  • ·Net-Centric Data Strategy
  • ·How they all relate
  • ·The potential and the pitfalls for each
  • ·How "industrial strength" metadata management makes it possible

3-D Knowledge Models

Henry Feinman
Information Architect
HJF Infosolve

The structure of Enterprise knowledge is complex – as complicated as any concrete product of our modern organizations. Other areas of endeavour have encountered the problem of complexity: Automotive design, structural architecture, computer chip design. In all these modelling domains the solution has been to use layers, perspective and 3-D CAD to build models that guide creation of these complex objects.

Firstly: Data – Structured and Unstructured, Business rules, Process, Entymologies – which of these belong to our modelling endeavours? What part of Enterprise knowledge is captured by our models, what part remains without?

Next: What alternatives have we for perspective and layering techniques: Data / Process split; Zachman’s enterprise framework; the traditional conceptual / logical / physical; abstraction. What are the strengths and weaknesses of different diagramming methods – ER, UML, ORM?

Finally: we will build sample layers using these techniques and then arrange an optimal set into a 3-D model that more fully renders and communicates the richness of Enterprise Knowledge.

We will step through the model using VRML, parts explosions, and other tools available to modern CAD modellers and see what insights bec