6th International Conference on
Enterprise Information Systems

ICEIS 2004
ABSTRACTS





 

 

 
 

Call For Papers
Doctoral Consortium
Program Committee
Keynote Lectures
Tutorials
Workshops

Accepted Papers
Paper Templates
Proceedings
Reviewers Only
Registration

Conference Program
Transport and Accomodation
Social Events
Local Information

Organizing Committee
Sponsors
Journals
Hall of Fame
Links

Organized by:

INSTICC

Co-organized by:


Universidade Portucalense






 



Area 1 - Databases and Information Systems Integration
Area 2 - Artificial Intelligence and Decision Support Systems
Area 3 - Information Systems Analysis and Specification
Area 4 - Software Agents and Internet Computing
Area 5 - Human-Computer Interaction


AREA 1 - Databases and Information Systems Integration

Title:

A RECONFIGURATION ALGORITHM FOR DISTRIBUTED COMPUTER NETWORKS

Author(s):

Chanan Glezer , Moshe Zviran

Abstract: This article presents an algorithmic reconfiguration model, combining mechanisms of load balancing and fault tolerance in order to increase utilization of computer resources in a distributed multi-server, multi-tasking environment. The model has been empirically tested in a network of computers controlling telecommunication hubs and is compared to previous efforts to address this challenge.

Title:

BVA+ - A BIT VECTORS ALGORITHM FOR ACCELERATING QUERIES IN MULTILEVEL SECURE DATABASES

Author(s):

Ramzi Haraty , Arda Zeitunlian

Abstract: Much research has been done in the area of multilevel database systems, especially in the security area and accelerating queries. In this paper, we present BVA+, which is based on bit vectors to accelerate queries in multilevel secure database systems. As its predecessor (BVA), the BVA+ algorithm follows the classic Sea View Model, but it recovers query output from single-level relations in a faster and more space-efficient manner than the previous works performed on this subject. In addition, the BVA+ algorithm does not produce spurious or extra tuples, which have always been a major problem in the area of multilevel secure database systems.

Title:

CONNECTIVITY OF ERP SYSTEM

Author(s):

Vatcharaporn Esichaikul

Abstract: The study is an attempt to propose the criteria for determining the appropriate connectivity of ERP systems. The result of this study provides a framework assisting ERP adopters in selecting integration approach which are appropriate to their needs. A survey was conducted to obtain information from ERP users to learn about their opinions on factors and criteria affecting connectivity of ERP systems. Findings from the study revealed that data oriented approach and application integration oriented approach are the most preferred integration methodologies. Opinions on criteria for evaluating ERP connectivity are nature of business process of organization, availability of technologies and service supports, nature of information system of organization, system flexibility, degree of integration, transaction volume, implementation cost, ease of maintenance, implementation time, security, and budget. Finally, the study proposes a framework to determine appropriate connectivity of ERP systems.

Title:

CONCEPTUAL MODEL FOR SOFTWARE FAULT LOCALIZATION

Author(s):

Abdallah Tubaishat

Abstract: Existing cognitive science and psychology studies suggest that a bi-level approach to fault localization is needed with both shallow and deep reasoning. This approach form the underpinnings for developing our Conceptual Model for Software Fault Localization (CMSFL) to aid programmers with the problem of software fault localization. Our CMSFL proposes that, during the fault localization process programmers build two mental models: an actual code model (the buggy code), and an expectation model (the correct code). A multi dimensional approach is suggested with both shallow and deep reasoning phases to enhance the probability of localizing many types of faults.

Title:

ASSESSING EFFORT PREDICTION MODELS FOR CORRECTIVE SOFTWARE MAINTENANCE - AN EMPIRICAL STUDY

Author(s):

Eugenio Pompella , Andrea De Lucia , Silvio Stefanucci

Abstract: We present an assessment of an empirical study aiming at building effort estimation models for corrective maintenance projects. We show results from the application of the prediction models to a new corrective maintenance project within the same enterprise and the same type of software systems used in a previous study. The data available for the new project are finer grained according to the indications devised in the first study. This allowed to improve the confidence in our previous empirical analysis by confirming most of the hypotheses made and to provide other useful indications to better understand the maintenance process of the company in a quantitative way.

Title:

SUPPORTING KNOWLEDGE REUSE DURING THE SOFTWARE MAINTENANCE PROCESS THROUGH AGENTS

Author(s):

Mario Piattini , Aurora Vizcaino

Abstract: Knowledge management has become an important topic as organisations wish to take advantage of the information that they produce and that can be brought to bear on important decisions. This work describes a system to manage and reuse the information (and knowledge) generated during the software maintenance process, which consumes a large part of the software lifecycle costs. The architecture of the system is formed of a set of agent communities. Each community manages different types of knowledge. The communities’ agents have the goal of encouraging the reuse of good solutions and taking advantage of information obtained from previous experience. In consequence, the software maintenance is made easier and there are less costs and effort. To achieve this goal, agents use several reasoning techniques such as case based reasoning or decision tree based algorithms which allow them to generate new knowledge from the information that they manage.

Title:

RETRO-DYNAMICS AND E-BUSINESS MODEL APPLICATION FOR DISTRIBUTED DATA MINING USING MOBILE AGENTS

Author(s):

MOHAMED  MEDHAT , EZENDU  ARIWA

Abstract: Distributed data mining (DDM) is the semi-automatic pattern extraction of distributed data sources. The next generation of the data mining studies will be distributed data mining for many reasons. First of all, most of the current used data mining techniques require all data to be resident in memory, i.e., the mining process must be done at the data source site. This is not feasible for the exponential growth of the data stored in organization(s) databases. Another important reason is that data is inherently distributed for fault tolerance purposes. DDM requires two main decisions about the DDM implementations: A distributed computation paradigm (message passing, RPC, mobile agents), and the used integration techniques (Knowledge probing, CDM) in order to aggregate and integrate the results of the various distributed data miners. Recently, the new distributed computation paradigm, which has been evolved as mobile agent is widely used. Mobile agent is a thread of control that can trigger the transfer of arbitrary code to a remote computer. Mobile agents paradigm has several advantages: Conserving bandwidth and reducing latencies. Also, complex, efficient and robust behaviours can be realized with surprisingly little code. Mobile agents can be used to support weak clients, allow robust remote interaction, and provide scalability. In this paper, we propose a new model that can benefit from the mobile agent paradigm to build an efficient DDM model. Since the size of the data to be migrated in the DDM process is huge, our model will overcome the communication bottleneck by using mobile agents paradigm. Our model divides the DDM process into several stages that can be done in parallel on different data sources: Preparation stage, data mining stage and knowledge integration stage. We also include a special section on how current e-business models can use our model to reinforce the decision support in the organization. A cost analysis in terms of time consumed by each minor process (communication or processing) is given to illustrate the overheads of this model and the other models.

Title:

IMPORTANT FACTORS IN ERP SYSTEMS IMPLEMENTATIONS

Author(s):

Piotr Soja

Abstract: In the article the problem of success factors in ERP systems implementations has been discussed. The review of the literature concerning success factors has been discussed and the collection of potential ERP implementation success factors was identified. Next, the result of research has been presented, where respondents have been asked about their opinion about the importance of subsequent factors for the implementation success. There were two groups of respondents: the first consisted of people from Polish enterprises implementing ERP systems and the second comprised experts working in ERP systems suppliers. On the basis of the research, the most important and necessary factors in the respondents’ opinions have been identified, as well as the least important ones.

Title:

IDENTIFYING CLONES IN DYNAMIC WEB SITES USING SIMILARITY THRESHOLDS

Author(s):

Giuseppe Scanniello , Andrea De Lucia , Genny Tortora

Abstract: We propose an approach to automatically detect duplicated pages in dynamic Web sites. Our approach analyzes both the page structure, implemented by specific sequences of HTML tags, and the displayed content. In addition, for each pair of dynamic pages we also consider the similarity degree of their scripting source code. The similarity degree of two pages is computed using different similarity metrics for the different parts of a web page based on the Levenshtein string edit distance. We have implemented a prototype to automate the clone detection process on web applications developed using JSP technology and used it to validate our approach in a case study.

Title:

INFORMATION TECHNOLOGY STRATEGIC PLANNING: ADAPTING FACTS AND BELIEFS TO BUSINESS STRATEGY GENERATION

Author(s):

Julio Bernardo Clempner Kerik , Agustín Francisco Gutiérrez Tornés

Abstract: This paper introduces a framework for adpating facts and beliefs to business strategy generation. The adaptation process model is supported by an information technology planning (ITSP) model and methodology. Tha aim of this paper is to validate the model. In the ITSP model, real world is composed by entities realated in terms of goals, beliefs, etc., through interaction they incorporate or refuse facts or beliefs related to the enviornment conditions. The adaptation concept is proposed to generate gusiness strategies. Two different methos are proposed: 1)an inference logic method, that emplys facts related to the enviornment conditions to generate new business strategies; and 2) case-based reasoning, a storred cases recorgind specific prior episodes, that induce the incorporation of business strategies. Both methods are presented. The adaptation process is presented through application examples.

Title:

ERP BASED BUSINESS PROCESS REENGINEERING IN A HUMAN RESOURCES DEPARTMENT: A CASE STUDY APPROACH

Author(s):

THEODORA CHATZIKALLIA , KONSTANTINOS  CHERTOURAS

Abstract: Modern organizations are constantly facing new challenges regarding the reengineering of their business departments and processes. By the term Business Process we mean the profile of specific methods that can be employed to perform specific business tasks. In general, each Business Process is uniquely tailored to the organization it applies. Therefore, the resolution of a Business Process related problem is typically carried out with custom methods developed within organizations. In this paper we propose the use of Enterprise Resource Planning (ERP) as the basis for reengineering a business department and effectively the Business Process that it carries through. We discuss the application of ERP in the reengineering of the Business Process of a real world organization department (a Human Resources Department), which lead to a significant productivity enhancement.

Title:

ORGANIZATIONAL AND TECHNOLOGICAL CRITICAL SUCCESS FACTORS BEHAVIOR ALONG THE ERP IMPLEMENTATION PHASES

Author(s):

Jose Esteves , joan pastor

Abstract: During the last years some researchers have studied the topic of critical success factors in ERP implementations. Up to this moment, there is not enough research on the management and operationalization of critical success factors within ERP implementation projects. The identification of factors leading to success or failure of ERP systems is an issue of increasing importance, since the number of organizations choosing the ERP path keeps growing. In this paper, we analyzed the evolution of organizational and technological factors along the ERP implementation phases. Our findings suggest that while both good organizational and technological perspectives are essential for a successful ERP implementation project, their importance shifts as the project moves through its lifecycle.

Title:

ACME-DB: AN ADAPTIVE CACHING MECHANISM USING MULTIPLE EXPERTS FOR DATABASE BUFFERS

Author(s):

Markus Kirchberg

Abstract: An adaptive caching algorithm, known as Adaptive Caching with Multiple Experts (ACME), has recently been presented in the field of web-caching. We explore the migration of ACME to the database caching environment. By integrating recently proposed database replacement policies into ACME's existing policy pool, an attempt is made to gauge ACME's ability to utilise newer methods of database caching. The results suggest that ACME is indeed well-suited to the database environment and performs as well as the best currently caching policy within its policy pool at any particular moment in its request stream. Although execution time increases by integrating more policies into ACME, the overall processing time improves drastically with erratic patterns of access, when compared to static policies.

Title:

EVALUATION OF A DOCUMENT DATABASE DESCRIPTION BY DIFFERENT XML SCHEMAS

Author(s):

Pierre Bazex , Madani Kenab , Tayeb Ould Braham

Abstract: Title : Evaluation of a Document Database Description by Different XML Schemas Authors : Madani Kenab (1,2), Tayeb Ould Braham (2), Pierre Bazex (1) (1) IRIT, 118, Route de Narbonne 31062 Toulouse, France { kenab@info.unilim.fr, bazex@irit.fr } (2) MSI, 83, Rue d'isle 87000 Limoges, France { ould@unilim.fr } Address contact: Tayeb Ould Braham Email: ould@unilim.fr Tel : 33 5 55 43 69 71 Fax : 33 5 55 43 69 77 Abstract A document database could be represented by different XML schemas, it depends on the content of the documents that it contains. From a simple conceptual schema of a database containing structured data that we represent in form of a document, we propose and evaluate different XML schemas describing this database in order to deduce the best one. For the building of these XML schemas we propose different descriptions of the key concepts of the relational model (relation, key and reference link) . We also propose the description of different nestings between the elements of the document (total nesting, partial nesting and without nesting) . We conclude that the best-adapted XML schema depends on the use that we wish to do with this database and it is a combination of the representation of different concepts. This work is a preliminary of the integration of a relational database thanks to the best XML schema. Key Words : Entity-Association, Relational Concepts, XML Schema, XML Document, Nesting Elements.

Title:

TRANSACTION DESIGN FOR DATABASES WITH HIGH PERFORMANCE AND AVAILABILITY

Author(s):

Lars  Frank

Abstract: When many concurrent transactions like ERP and E-commerce orders want to update the same stock records, long duration locking may reduce the availability of the locked data. Therefore, transactions are often designed without analyzing the consequences of loosing the traditional ACID (Atomicity, Consistency, Isolation and Durability) properties. In this paper, we will analyze how low isolation levels, optimistic concurrency control, short duration locks, and countermeasures against isolation anomalies can be used to design transactions for databases with high performance and availability. Long duration locks are defined as locks that are held until a transaction has been committed, i.e. the data of a record is locked from the first read to the last update of any data used by the transaction. This will decrease the availability of locked data for concurrent transactions, and, therefore, optimistic concurrency control and low isolation levels are often used. However, in systems with relatively many updates like ERP-systems and E-commerce systems, low isolation levels cannot solve the availability problem as all update locks must be exclusive. In such situations, we will recommend the use of short duration locks. Short duration locks are local locks that are released as soon as possible, i.e. data will for example not be locked across a dialog with the user. Normally, databases where only short duration locks are used do not have the traditional ACID properties as at least the isolation property is missing when locks are not hold across a dialog with the user. The problems caused by the missing ACID properties may be managed by using approximated ACID properties, i.e. from an application point of view the system should function as if all the traditional ACID properties had been implemented. Examples using E-commerce will illustrate how to use the transaction design recommended in this paper. We have cooperated with one of the major ERP software companies in designing our transaction model.

Title:

INCREMENTAL DATA QUALITY IN THE DATA WAREHOUSE

Author(s):

Karsten Boye Rasmussen

Abstract: The data warehouse is the cornerstone for the production of business knowledge in the organization. The foundation of the quality of the business knowledge is the quality of the data in the data warehouse. Determination of dimensions of data quality in the data warehouse has been obtained through the intuitive, the empirical and the ontological approaches. The first point of this working paper is that data quality is not a static measure and that awareness of the data quality dimensions is a prerequisite to improve the data quality. The second point is that selection is the cornerstone of data quality in the data warehouse in relation to the quality dimensions. Thirdly, that post-load improvement of the data quality is obtainable. Metadata can be added incrementally containing information on the use of data – and thus the users' selections within the data warehouse – and on the users' judgment of the data.

Title:

A MIDDLEWARE FOR THE MANAGEMENT OF LARGE UTILITIES PLANTS

Author(s):

andrea rossettini , salvatore cavalieri , carmelo floridia , fabrizio d'urso

Abstract: The paper presents the main features of the European project Mobicossum IST 1999-57455, still running. The project is a CRAFT one approved inside the Fifth Framework Programme. It aims to define a middleware offering services for the management of large plants, in the field of gas and water distribution and waste water treatment systems. In the paper, the main features of the project will be explained, focusing on the description of the implementation of the core of the middleware, called Generalised Interface.

Title:

ACQUIRING AND INTEGRATING EXTERNAL DATA INTO DATA WAREHOUSES

Author(s):

Mattias Strand , Benkt  Wangler , Carl-Fredrik Laurén

Abstract: Data warehouses (DWs) has become one of the major IT-investments during the last decades and in order to fully exploit the potential of data warehouses, more and more organizations are acquiring and integrating external data into their star-schemas. However, the literature covering external data acquisition and integration is limited. Therefore, in this paper the results of an interview study conducted among banking organizations are presented. The study aimed at identifying different approaches for acquiring and integrating external data into DWs. The results show that there are many different approaches for the acquisition and integration, depending on the purpose and structure of the data being acquired. In addition, the most common external data acquisition and integration process is presented and discussed.

Title:

A CONCEPTUAL FRAMEWORK FOR FORECASTING ERP IMPLEMENTATION SUCCESS - A FIRST STEP TOWARDS THE CREATION OF AN IMPLEMENTATION SUPPORT TOOL

Author(s):

Fredrik Carlsson , Andreas  Nilsson , Johan Magnusson

Abstract: The continuing soar in popularity when it comes to standardized information systems sold en masse under the labelling of Enterprise Resource Planning (ERP) Systems is somewhat kept under control by the ever flowing stream of reports from the industry of implementations gone bad. According to some researchers it is possible to assume that as many as 90% of all initiated ERP implementation projects can be regarded as failures as a result of changes in scope, prolongation of the project time or simply budget overruns. With the implementation of an ERP system being a very costly and risky endeavour, organizations considering “getting on the bandwagon” stand much to gain from pre-emptively forecasting the probability of success for an ERP implementation in their enterprise. Given this, the purpose of this paper is to investigate a possible conceptual framework for forecasting ERP implementation success and discuss the role of such a framework in a software based tool. This was achieved through an initial in-depth literary review aimed at finding factors affecting the outcome of the ERP implementation projects. These results were then communicated to an industrial support group comprised of possible ERP implementation stakeholders. After lengthy discussions concerning the usability, validity and reliability of the proposed list of factors, a conceptual framework was agreed upon for forecasting ERP implementation success. The framework was then tested against a number of possible stakeholders outside the industrial support group. As the results show we have been able to create a conceptual framework for forecasting ERP implementation success that is currently in the second wave of testing. The usability, validity and reliability of the framework is discussed and elaborated upon, and this paper concludes that the perceived usability and hence also value of the conceptual framework is substantial, whereas the validity and reliability remain to be tested.

Title:

VIRTUAL ORGANIZATIONS AND DATABASE ACCESS - A CASE STUDY

Author(s):

Marko NIINIMAKi , Mikko Pitkanen , John White , Tapio Niemi

Abstract: This paper presents a case study of using virtual organization technologies in database access. A virtual organization (VO) is a collection of people in the same administrative domain. A user can belong to many virtual organizations and have a different role (user, client, administrator,..) in each of them. An authorization of a user to different services within a VO is based on the user's identity and a service called a Virtual Organization Membership Service (VOMS) that maps these identities with roles. The user's identity can be established in two ways. If the user communicates with the service using his web browser, the user's certificate must be included in the browser. Another possibility is to use a proxy certificate. There, in the proxy creation process, the program that writes the proxy adds the user's proxy certificate information about his participation in different VO's and his role in each of them. In order to demonstrate using these VO proxy certificates, we have extended the functionality Spitfire, a relational database front end. This involves assigning the user a database role (read/write/update) based on the VO information in his certificate. There is also a GUI for creating the mappings between VO roles and database access roles.

Title:

REASONS FOR ERP ACQUISITION

Author(s):

Sami Sarpola , Sanna Laukkanen , Petri Hallikainen

Abstract: Numerous reasons for why organisations acquire Enterprise Resource Planning (ERP) systems have been proposed in prior research. In this paper we form a synthesis of these different reasons and categorize them into technological and business reasons for acquiring ERP. Further, we test the validity of these reasons with empirical data concerning the acquisition of ERP systems in 41 Finnish companies.

Title:

DELEGATING AUTHORITY IN A DISTRIBUTED INFORMATION MANAGEMENT SYSTEM

Author(s):

Janet Barnett , Barbara Vivier , Kareem Aggour

Abstract: The need to manage large information repositories securely in a distributed environment increases with the growth of the Internet. To address this need, a system capable of managing the contents of an LDAP directory over the Web has been designed and developed. This system allows for the directory’s data to be divided into communities and supports the delegation of administrative authority over those communities to a distributed set of administrators. The communities may be subdivided recursively into subgroups, and rights over those subgroups also may be restricted. Thus, system administrators can dynamically delegate subsets of their permissions over a subset of their managed data, allowing for the effective control of permissions over the data within distributed organizations. The system solves the delegated administration problem for managing the contents of an LDAP directory in a distributed environment. Today, it supports the administration of over 20 production directories by well over 2000 distributed administrators.

Title:

DISTRIBUTED DATABASE SYSTEM OF AGRICULTURAL SCIENCE AND TECHNOLOGY ALLIANCE OF UNIVERSITIES IN CHINA

Author(s):

Longyong You , Junjing Yuan , Jiayun Wang , Jian Zhang

Abstract: There are three problems that need to be solved by establishing the Distributed Database System of the information platform of Chinese universities' Agricultural Science and Technology Alliance: distribution of the data resources, decomposition and optimization of the distributed query as well as safety of the data system. In this paper, firstly, through the overall analysis of the contents of Chinese universities' Agricultural Science and Technology Alliance, we establish the mixed data distributed system, make the database system more integrated, consistent and reliable, meanwhile improve efficiency of the local application. Secondly because the member of the alliance adopts different data mode, taking a query decomposition and optimization for overall mode in the way of extended semi-join will be the effective method to improve the system response time. Finally, utilizing the method of combining asymmetry encryption with symmetry encryption, we solve the safety problems of database identity validation, data transmission, visitation control and etc.

Title:

A DATA WAREHOUSE ARCHITECTURE FOR BRAZILIAN SCIENCE AND TECHNOLOGY ENVIRONMENT

Author(s):

Andre Luís Menolli , Maria Madalena Dias

Abstract: Science and technology in Brazil are areas that have few available resources and many times these scarce resources are badly used. The data warehouse is a tool that can make possible a better distribution of these resources. In this article are considered some issues in the development of a data warehouse for Science & Technology management. The paper describes the necessity of a supporting system to the decision taking regarding the distribution of the resources destined to Science & Technology in Brazil, and also shows a data warehouse architecture that is being developed to support this system. Data Modeling characteristics defined for the proposed data warehouse architecture are presented too.

Title:

SOFTWARE PRODUCT LINE ANALYSIS OF ENTERPRISE INFORMATION SYSTEM

Author(s):

Luiz Fernando Capretz , Faheem Ahmed

Abstract: Now a days geographical and physical constraint that allowed only for fixed and static placements of resources has vanished completely within an enterprise utilizing the concept of information technology to integrate their business needs. The object oriented programming approach has paved a way to reusability of components thus reducing cost and development efforts up to certain extend. Software product line has further strengthened the concept of reusability, and component-based architecture. In this paper we have analyzed the concept of Software Product Line Analysis for an Enterprise Information System which will help to construct a Software Product Line within the organization to produce high quality software product in order to full fill the information technology requirements of the organization.

Title:

AN APS ARCHITECTURE FOR WEB SERVICES BASED ENTERPRISE INTEGRATION

Author(s):

William Liu , FengYu Wang , Tay Jin Chua

Abstract: Web Services enabling technology is widely used to address enterprise integration within company or cross-organizations due to its language and operating system independency and support of loosely coupled integration. This paper presents an architecture for APS (Advanced Planning and Scheduling) system by describing an APS request handling engine and web services based functions, attempting to solve integration issues among APS, MES, ERP and other manufacturing systems that could not be handled properly using current approaches. In addition, as manufacturing planning has been extended to cover entire supply chain, this paper also discusses the necessary changes of the proposed architecture to cater for the extension. That would be helpful to figure out capacity issue in a big picture

Title:

OBTAINING E-R DIAGRAMS SEMI-AUTOMATICALLY FROM NATURAL LANGUAGE SPECIFICATIONS

Author(s):

Farid Meziane

Abstract: Since their inception, entity relationship models have played a central role in systems specification, analysis and development. They have become an important part of several development methodologies and standards such as SSADM. Obtaining entity relationship models, can however, be a lengthy and time consuming task for all but the very smallest of specifications. This paper describes a semi-automatic approach for obtaining entity relationship models from natural language specifications. The approach begins by using natural language analysis techniques to translate sentences to a meaning representation language called logical form language. The logical forms of the sentences are used as a basis for identifying the entities and relationships. Heuristics are then used to suggest suitable degrees for the identified relationships. This paper describes and illustrates the main phases of the approach and presents a summary of the results obtained when it is applied to a case study.

Title:

TOWARDS CONCEPTUAL MEDIATION

Author(s):

Ismael Navas D. , José F. Aldana M.

Abstract: Mediators are usually developed as monolithic systems which envelope the data source’s semantics as well as its location. Furthermore, its architecture based on wrappers involves a high coupling degree among the mediator’s components. This coupling does not allow sharing services with other organizations or the dynamic integration of new data sources. Therefore, wrappers must be re-designed and manually added for each mediation system. We propose an architecture for conceptual mediation in which the sources’ query capabilities are published as web services. These services can be registered in one or more resource directories (Semantic Directories), which are the core of this architecture because they provide the needed flexibility and scalability for dynamic integration. Finally, we show an application in a bioinformatics context to validate our approach.

Title:

AN AUTOMATION SYSTEM BASED ON LABVIEW TO CONTROL THE TEST OF MECHANICAL FLOW METERS

Author(s):

Víctor Mejia , Javier  Martínez , Victor Silva , Ricardo Alvarez , Petronilo Cortez

Abstract: A mechanical flow meter is a device used mainly to measure and calculates velocity of weater´s flow on rivers and open channels. These devices, as the time of use pass trough, suffer mechanical imperfections, that's why it is important to calibrate them twice a year, depending of its time of use. At the Mexican Institute of Water Technology (IMTA in Spanish) was designed and developed a circular water tank for propose of test of these meters. The present paper shows the automation systems designed to control the tests to calibrate these mechanical meters. The system is based on LabVIEW. LabVIEW is a general purpose programming tool with extensive libraries for data acquisition instrument control, data analysis, and data presentation. With this tool and a special hardware interface, it was possible to automate the process to test these meters. The system is called SCM (System of characterization of mechanical meters). SCM control the test of two mechanical meters simultaneously, and has some user's control features that permit the Operator a easy to use human machine interface.

Title:

FUZZY MULTIPLE-LEVEL SEQUENTIAL PATTERNS DISCOVERY FROM CUSTOMER TRANSACTION DATABASES

Author(s):

Huilin Ye , An Chen

Abstract: Sequential patterns discovery is a very important research topic in data mining and knowledge discovery, and it has been widely applied in business analysis. Previous works were focused on mining sequential patterns at a single concept level based on definite and accurate concept which may not be concise and meaningful enough for human experts to easily obtain nontrivial knowledge from the rules discovered. In this paper, we introduce concept hierarchies firstly, and then discuss a mining algorithm F-MLSPDA for discovering multiple-level sequential patterns with quantitative attribute based on fuzzy partitions.

Title:

A METADATA REPOSITORY FOR IMAGE RETRIEVAL ALGORITHMS

Author(s):

Sahudy Montenegro González , Akebo Yamakami

Abstract: Many of the problems involved in image database applications require some form of retrieval based on image content. The explosion in availability of image content, due to recent developments in multimedia technology, demands the formulation of algorithms to facilitate content-based retrieval. Many image retrieval algorithms are implemented according to the needs of specific applications. Yet, there is currently no standard form of manipulation for these algorithms. This fact has a secure impact on the availability of algorithms beyond the bounds of the application for which they were originally designed. This work defines a general purpose repository for the algorithms involved in the process of Image Retrieval. The main goal of the repository is to provide the application developer with an infrastructure to manipulate and query image algorithms, allowing the integration of the image retrieval algorithms, the creation of a stock of algorithms available to multiple users, and to reuse/share algorithms for multiple applications. We define a standard set of metadata, applicable to image retrieval algorithms, providing uniform semantic support to understand these algorithms. This repository acts as a support to the development of image retrieval applications. The repository architecture is centered on providing distributed database functionality.

Title:

THE CONCEPT AND IMPLEMENTATION OF THE MARKET PLACE E-UTILITIES•COM

Author(s):

Jamil Dimassi , Carine Souveyet , Colette Rolland

Abstract: In order to remain competitive in a deregulated environment, a group of European Utilities developed a prototype of a single Marketplace called e-utilities•com whose mandate is a clear customer centric orientation in the European environment for a successful mid-term multi-utility business via the Web. This paper highlights the concept of e-utilities•com and its implementation in a Web portal.

Title:

PERFORMANCE INDICATORS: IMPORTANT TOOL FOR BUSINESS INTELLIGENCE AND INFORMATION SYSTEMS

Author(s):

María Luisa Sené

Abstract: In this paper is treated the importance of performance indicators in order to have a healthy organization. Also are given elements to understand why standardization is so related to this topic, and the most important thing, how all this contributes to design an information system that will help the organization in the process of decision-making. Are included examples of performance indicators that can be applied in any organization.

Title:

ACCESS MODEL IN COOPERATIVE INFORMATION SYSTEMS

Author(s):

Eric Disson , Danielle Boulanger

Abstract: This research focuses on access security in cooperating information systems. The offered modeling has to treat the interoperation of open and evolutive information systems and, moreover, has to guarantee the respect of various local security policies. The coexistence of heterogeneous information sources within an information systems framework involves homogenization problems between local security policies. We distinguish two types of heterogeneity: heterogeneity of the local access policies and semantic heterogeneity between object or subject instances of the local access schemas. To solve this twofold difficulty, we propose an original role model allowing a unified representation of local access schemas. This model preserves the flow control properties in the three main access policies (discretionary, role based model and multilevel models). The described access schemas are enriched to establish intra-system access authorizations.

Title:

BUSINESS MODELLING THROUGH ROADMAPS

Author(s):

Judith Barrios Albornoz , Jonás  Montilva Calderón

Abstract: Business modelling is a central activity to many different areas, including Business Process Reengineering, Organisational Development, Enterprise Modelling & Integration, Business Process Management and Enterprise Application Integration. It is well known that the business domain is not easy to understand neither to represent even for specialised people. The success of most of the contemporary methods for modelling Business Organisations or Enterprise Information Systems (EIS) is strongly associated with the level of understanding that the modelling team can attain about the specific situation being modelled. This understanding is directly related with the degree of modelling experience that the team has, as well as their ability to work with the techniques and tools prescribed by a specific method. Nowadays, most of the existing business modelling methods are concentrated in what are the business concepts and how to represent them. But, they lack of process guidance, which is needed to help the team through the modelling process. We elaborated the method BMM for modelling business application domains that provides working guidelines for the modelling team. This method, based on method engineering concepts helps teams to, not only, get a comprehensive knowledge about the business domain being modelled, but also, about the process of modelling the domain itself. This paper concerns with the representation of the process of modelling a business by using a decision oriented process model formalism. It is represented at a higher level by a roadmap. The main contribution of our work is a set of roadmaps that contains the knowledge associated with team member’s modelling experience in business modelling and EIS development. This knowledge arises from several case studies.

Title:

AUTOMATIC DISCOVERY OF SEMANTIC RELATIONSHIPS BETWEEN SCHEMA ELEMENTS

Author(s):

Nikos Rizopoulos

Abstract: The identification of semantic relationships between schema elements, or \schema matching, is the initial step in the integration of data sources. Existing approaches in automatic schema matching have mainly been concerned with discovering equivalence relationships between elements. In this paper, we present an approach to automatically discover richer and more expressive semantic relationships based on a bidirectional comparison of the elements data and metadata. The experiments that we have performed on real-world data sources from several domains show promising results, considering that we do not rely on any user or external knowledge.

Title:

MANAGING INFORMATION FLOW DYNAMICS WITH AGILE ENTERPRISE ARCHITECTURES

Author(s):

Drakoulis Martakos , Panagiotis  Kanellis , Nancy Alexopoulou

Abstract: New organization forms and ways of conducting business require architectures for enterprise systems that can support and not hinder entrepreneurial activities. Primarily this means that the information flow between both internal as well as cross-enterprise processes must be managed by underlying systems that offer a high level of automation as well as being highly flexible and integrated. In this respect, we present an agile architecture that offers a coherent and high level conceptualisation of the above properties that enterprise information systems should display, consider a number of technologies as potential implementation candidates and demonstrate how the architecture addresses node density, velocity, viscosity and volatility as parameters for managing and controlling the dynamics of information flows.

Title:

A TRANSACTIONAL MULTIMODE MODEL TO HANDLE OVERLOAD IN DISTRIBUTED RTDBSS

Author(s):

Samia Saad-Bouzefrane

Abstract: Current applications, such as Web-based services, electronic commerce, mobile telecommunication systems, etc. are distributed in nature and manipulate time-critical databases. In order to enhance the performance and the availability of such applications, the major issue is to develop efficient protocols that cooperate with the scheduler to manage the overload of the distributed system. In order to help real-time database management systems (RTDBS) to maintain data logical consistency while attempting to enhance concurrency execution of transactions, we introduce a transactional multimode model to let the application transactions adapt their behavior to the overload consequences. In this paper, we propose for each transaction several execution modes and we derive an overload controller suitable for the proposed multimode model.

Title:

A FRAMEWORK FOR EVALUATING DIFFICULTIES IN ERP IMPLEMENTATION

Author(s):

Jorge Marcelo Montagna , Luis Ferrario

Abstract: Various sources point out very high percentages of failures to implement ERP systems. In this work, the main difficulties for this task are analyzed and a systematic classification of fundamental reasons is intended. By considering the reasons that lead to failure, a simple and effective mechanism is generated to evaluate in advance complications the project might present. In this way, the tools to be used can be adjusted to the specific characteristics of the project. Somehow, it is intended to solve the problem presented by general methodologies, which are used for any kind of enterprise, without previously considering its conditions and state to face this type of projects.

Title:

STUDY OF DIFFERENT APPROACHES TO THE INTEGRATION OF SPATIAL XML WEB RESOURCES

Author(s):

Jose Corcoles , Pascual Gonzalez

Abstract: The research community has begun to investigate foundations for the next stage of the Web, called Semantic Web. Current efforts include the Extensible Markup Language XML, the Resource description Framework, Topic Maps and the DARPA Agent Markup Language DAML+OIL. A rich domain that requires special attention is the Geospatial Semantic Web. However, in order to approach the Geospatial Semantic Web, it is necessary to solve the problem of developing an integration system for querying spatial resources stored in different sources. In this paper, we study two different approaches to integrating spatial and non-spatial information represented in the Geographical Markup Language (GML). The approaches studied follow LAV (Local as View) integration. With this study we obtain the best approach to developing a real system for querying GML resources stored in different sources.

Title:

CAPABILITY-BASED QUERY PLANNING IN MEDIATOR SYSTEMS

Author(s):

Jiu Yang Tang

Abstract: This paper addresses the impact of capability description on query planning in heterogeneous data integration system. Query planning covers the selection of data sources related to the query and the determination of subgoals’ execution orders. In the context of capability description, we propose a framework for data sources description towards generating good feasible query plans. Our approach uses information such as the semantic correspondences between local schemas and mediated schemas and the query capability descriptions to investigate factors that provide a good foundation for query planning. Finally, the proposed approach is compared with the other capability description approaches described in the literature. The obtained results demonstrate that our approach will allow data sources to advertise their capabilities in a flexible way and help to efficiently query planning.

Title:

AN EFFICIENT B+-TREE IMPLEMENTATION IN C++ USING THE STL STYLE

Author(s):

Gregory Butler

Abstract: Database indexes are the search engines for database management systems. The B+-tree is one of the most widely used and studied data structuresand provides an efficient index structure for databases. An efficient implementation is crucial for a B+-tree index. Our B+-tree index is designed to be a container by following the style of the C++ Standard Template Library (STL) and implemented efficiently by using design patterns and generic programming techniques. Therefore, our B+-tree index can adapt to different key types, data types, different queries, and different database application domains, and be easy and convenient for developers to reuse just like other containers in the STL.

Title:

XRM: AN XML-BASED LANGUAGE FOR RULE MINING SYSTEMS

Author(s):

Dominique  Laurent , Tao-Yuan Jen , Ahmed Cheriat , Béatrice Bouchou , Mirian Halfeld-Ferrari

Abstract: In this paper, we present XRM, an XML-based language capable of promoting the collaboration among data mining systems. Indeed, KDD systems usually need a platform to integrate and exchange their results with different tools. XRM is a general framework to express any system results and/or data as logic formulas. In this way, XRM offers flexibility to represent data, constraints and patterns, and allows mining systems to present their results in an exchangeable format. In this work, we concentrate on the use of XRM to represent different forms of association rules. Association rule mining has evolved giving rise to sophisticate approaches that require interaction with other tools. XRM is built on XML Schema - in this way we can assure a certain level of correctness of data and mining results.

Title:

AUDIOVISUAL ARCHIVE WITH MPEG-7 VIDEO DESCRIPTION AND XML DATABASE

Author(s):

Pedro Almeida , Helder Troca Zagalo , Joaquim  Sousa Pinto , Joaquin Arnaldo Martins

Abstract: This article presents the work that has been developed in the creation of an audiovisual archive that uses the MPEG-7 standard to describe the video content and a XML database to store the video descriptions. It presents the model adopted to describe the video content, the framework of the audiovisual archive information system, a video indexing tool developed to allow the creation and manipulation of XML documents with the video descriptions and an interface to visualize the videos over the Web.

Title:

ENHANCING THE SUCCESS RATIO OF DISTRIBUTED REAL-TIME NESTED TRANSACTIONS

Author(s):

Majed Abdouli , Bruno Sadeg , Laurent Amanton

Abstract: The traditional transaction models are not suited to real-time database systems RTDBSs. Indeed, many current applications managed by these systems necessitate a kind of transactions where some of the ACID properties must be ignored or adapted. In this paper, we propose a real-time concurrency control protocol and an adaptation of the Two-Phase Commit Protocol based on the nested transaction model where a nested transaction is viewed as a collection of both essential and non-essential subtransactions: the essential subtransaction has a firm2 deadline, and the non-essential one has a soft3 deadline. We show through simulation results, how our protocol, based on this assumption, allows better concurrency between transactions and between subtransactions of the same transaction, enhancing then the success ration4 and the RTDBS performances, i.e.,more transaction may meet their deadline.

Title:

USING IUCLID FOR WORLDWIDE EXCHANGE OF CHEMICAL AND TOXICOLOGICAL INFORMATION

Author(s):

Stefan Scheer , Remi Allanou

Abstract: A database management tool (IUCLID) has been created in order to provide with administering chemical and toxicological data sent in structured form due to existing EU legislation. This tool also offers – beyond the normal dataset administration functionality – mechanisms for data fusion, data reproduction and data deployment. Thus IUCLID is used not only by who has to receive submissions of that kind but also who has to produce such submissions. Hence this product is used by whoever is involved as stakeholder in the current legislative process, and even beyond that it has been recognized successfully. Consequently it was the worldwide acceptance that helped in promoting this software product ahead of its original purpose and to establish a network of exchange.

Title:

RAPID XML DATABASE APPLICATION DEVELOPMENT

Author(s):

Kjetil Norvag , Albrecht Schmidt

Abstract: This paper proposes a rapid prototyping framework for XML database application development. By splitting up the development process into several refinement steps while keeping the application programming interface stable, the framework aims at rapid implementation of a prototype with a well-defined interface and a subsequent implementation of more advanced concepts like business rules in several steps. The refinement process takes the form of incrementally adding domain-specific information to the application. This is achieved by transgressing from general-purpose XML tools that do not support the definition and enforcement of constraints to frameworks that support domain-specific models and constraints such as E/R modeling. We have employed this method in the development of an example application, and we give performance numbers that illustrate the incremental improvements of each step.

Title:

ONTOLOGY-BASED REQUIREMENT ELICITATION

Author(s):

cong wang

Abstract: The key problem of information system development is how to acquire requirement. It has become the puzzled problem to the system developers for a long time. How to build a communication bridge between the developers and users has become a hot issue in requirement engineering. Ontology defines the common concepts and the relationships among them. A communication bridge can be built between the domain users and the system developers. Therefore, the ontology can direct the users and the developers to construct the requirement model. According to the different views of the system, this paper provides ontologies named business ontology, technique ontology and functionality Ontology for requirement elicitation. Firstly, this paper defines the concept of the ontology. Second, we describe the three ontologies in detail. Finally, through the ontologies, this paper provides the domain requirement model.

Title:

A TRANSACTION MODEL FOR LONG RUNNING BUSINESS PROCESSES

Author(s):

Jinling Wang , Beihong Jin , Jing Li

Abstract: Many business processes in the enterprise applications are both long running and transactional in nature, but currently no transaction model can provide full transaction support for such long running business processes. In this paper, we proposed a new transaction model — PP/T model. It can provide structural transaction support for the long running business processes, so that application developers can focus on the business logic, with the underlying platform providing the required transactional semantics. Simulation results show that the model has good performance in processing the long running business processes.

Title:

CACHING STRATEGIES FOR MOBILE DATABASES

Author(s):

Murilo de Camargo

Abstract: Caching remote data in local storage of a mobile client has been considered an effective solution to improve system performance for data management in mobile computing applications. In this paper, we propose a taxonomy for cache management in mobile database systems. The aim is to provide a unifying framework for the problem of caching in mobile computing, then a comparative review of the work done in this area up to now. Such a framework, with the associated analysis of the existing approaches, provides a basis for identifying strengths and weaknesses of individual methodologies, as well as general guidelines for future improvements and extensions.

Title:

DM-XIDS — AN APPLICATION-LAYER ROUTER OF INCOMING XML STREAMS

Author(s):

HAO GUI

Abstract: With the explosion of the information on the Internet and the widely use of the XML as a data exchange media, more and more information application can communicate with each other and deliver data of large volume in a continuous streaming. This trend has led to the emergence of novel concepts in data acquisition, integration, exchange, management and access. In this paper, we propose middleware architecture on XML streams information dissemination and design a prototype DM-XIDS as an applicable extension to our traditional database management system (named DM). Friendly graphical user interface is developed to efficiently generate and manage the diverse information subscriptions, which are described as queries in XPath. Effective algorithm is adopted to filter and match the ad hoc segment in the whole document. Automata-based query filtering mechanism will successfully implement the selection of data according to the queries in regular path expression that may include both nested path declaration and value predicate. Dedicated architecture is designed to accomplish our goals to dynamically direct the incoming XML data-stream from a static collection of information into a specific physically or logically distributed database environment. As a middleware of our database system, DM-XIDS presents a novel concept of an application-layer information router with additional administrative functions, which builds bridges between the XML stream source and the underlying data storage conforming to the pre-customized strategy.

Title:

AN APPROACH FOR SCHEMA EVOLUTION IN ODMG DATABASES

Author(s):

Cecilia Delgado Negrete

Abstract: Schema evolution is the process of applying changes to a schema in a consistent way and propagating these changes to the instances while the database is in operation. However, when a database is shared by many users, updates to the database schema are always difficult. To overcome this problem, in this paper we propose a version mechanism for schema evolution in ODMG databases that preserves old schemas for continued support of existing programs running on the shared database when schema changes are produced. Our approach uses external schema definition techniques and is based on the fact that if a schema change is requested on an external schema, rather than modifying the schema, a new schema, which reflects the semantics of the schema change, is defined.

Title:

COMPARISON OF APPROACHES IN DATA WAREHOUSE DEVELOPMENT IN FINANCIAL SERVICES AND HIGHER EDUCATION

Author(s):

Janis Benefelds , Laila Niedrite

Abstract: When a decision to develop a Data Warehouse is made, some sensitive factors should be evaluated to understand the tasks and prioritize them. Of course, priorities and conditions are unique in each Data Warehouse project development. In this paper we assume that there are common characteristics for companies of similar business activities and different for those with opposite activities. This article looks at the interpretation of the same criteria of two Data Warehouse projects in for-profit and not-for-profit areas. As representatives of for-profit and not-for-profit areas we selected financial services (banking) and higher education institutions. We have used the criteria from (List et al. 2002) to compare the results of the two projects. Each section of the paper describes this set of criteria for each of the two areas. The Data Warehouse development methodology used in each case is described too. An evaluation matrix is provided in Conclusion. The results shown there are not very different from Data Warehouse project development in an organization with respectively different behavior.

Title:

CORRELATING EVENTS FOR MONITORING BUSINESS PROCESSES

Author(s):

Josef Schiefer , Carolyn McGregor

Abstract: With the increasing demand for real-time information on critical performance indicators of business processes, the capturing, transformation and correlation of real-world events with minimal latency are a prerequisite for improving the speed and effectiveness of an organization's business operations. Events often include key business information about their relationship to other events that can be utilized to collect relevant event data for the calculation of business performance indicators. In this paper we introduce an approach for correlating events of business processes that uses correlation sessions to represent correlation knowledge. Correlation sessions facilitate the processing of data across multiple events and thereby enable a calculating of business metrics in near real-time. The benefit over existing approaches is that it is tailored to instrument business processes and business applications that may operate in a heterogeneous software environment. We propose a Java-based, container-managed environment which provides a distributed, scalable, near-real time processing of events and which includes a correlation service that effectively manages correlation sessions. We also show a complete example that illustrates how correlation sessions can be utilized for computing the cycle time of business processes.

Title:

TRANSFORMATION-ORIENTED MIDDLEWARE FOR LEGACY SYSTEM INTEGRATION

Author(s):

Urs Frei , Guido Menkhaus

Abstract: Most established companies have acquired legacy systems through mergers and acquisitions. The systems were developed independently of each other and very often they do not align with the evolving IT infrastructure. Still, they drive day-to-day business processes. Replacing the legacy application with new solutions might not be feasible, practical or cost a considerable amount of time. However, immediate integration might be a requirement for a strategic project, such as supply chain management or e-business. This article presents a transformation system for legacy system integration that allows flexible and effective transformation of data between heterogeneous systems. Sequences of transformations are described using a grammar based approach.

Title:

SCHEMA EVOLUTION FOR STARS AND SNOWFLAKES

Author(s):

Christian Kaas , Torben Bach  Pedersen , Bjørn  Rasmussen

Abstract: The most common implementation platform for multidimensional data warehouses is RDBMSs storing data in relational star and snowflake schemas. DW schemas evolve over time, which may invalidate existing analysis queries used for reporting purposes. However, the evolution properties of star and snowflake schemas have not previously been investigated systematically. This paper systematically investigates the evolution properties of star and snowflake schemas. Eight evolution operations are considered, covering insertion and deletion of dimensions, levels, dimension attributes, and measure attributes. For each operation, the formal semantics of the changes for star and snowflake schemas are given, and instance adaption and impact on existing queries are described. Finally, we compare the evolution properties of star and snowflake schemas, concluding that the star schema is considerably more robust towards schema changes than the snowflake schema.

Title:

AN EVENT PROCESSING SYSTEM FOR RULE-BASED COMPONENT INTEGRATION

Author(s):

Susan  Urban

Abstract: The IJK project has developed an environment in which active rules, known as integration rules, are used together with transactions to provide an event-driven, rule-based approach to the integration of black-box components. This paper presents the event processing system that supports the use of integration rules over components. The event processing system is composed of the language framework for the specification of different types of events, an event generation system for generating event instances, and an event handler for communicating the occurrence of events to the integration rule processor. The language framework supports the enhancement of EJB components with events that are generated before and after the execution of methods on components. Since integration rule support an immediate coupling mode and execute in the context of nested transactions, a synchronization algorithm has been developed to coordinate the execution of immediate integration rules with the execution of methods on components. The synchronization algorithm makes it possible to suspend and resume distributed application transactions to accommodate the nested execution of integration rules with an immediate coupling mode.

Title:

CONV2XML: RELATIONAL SCHEMA CONVERSION TO XML NESTED-BASED SCHEMA

Author(s):

Angela Duta , Ken Barker

Abstract: Conversion of relational data to XML is a critical topic in the database area. This approach translates the rigid tabular structures of relational databases into hierarchical XML structures. Logical connections between bits of data depicted by relationships are represented more naturally by tree-like structures. Conv2XML and ConvRel are two algorithms for converting relational schema to XML Schema focusing on preserving the source relationships and their structural constraints. ConvRel translates each relationship individually into a nested XML structure. Conv2XML identifies complex nested structures capable of modeling all relationships existent in a relational database.

Title:

APPLYING CROSS-TOPIC RELATIONSHIPS TO SEARCHING WITH INCREMENTAL RELEVANCE FEEDBACK

Author(s):

Stephen  Chan

Abstract: General purpose search engines such as Google and Yahoo define search topic hierarchies for document organization, yet such hierarchical structures cover only a portion of the possible relationships among search topics. It is believed that search effectiveness can be improved significantly by making better use of the semantic relations among search topics. In general, the is-child relation allows starting a search from general concepts, while the is-neighbor relation provides fresh information that can help users identify related search areas. This paper describes a topic network encompassing such relations, based on Bayesian networks techniques, to support searching, Our experiments show that making use of such a topic network can improve search effectiveness in a search engine using incremental feedback

Title:

INFORMATION INVASION IN ENTERPRISE SYSTEMS

Author(s):

Stephen Crouch , Peter Henderson , Robert Walters

Abstract: With the proliferation of internet-based technologies within and between organisations, large-scale enterprise systems are becoming more interconnected than ever before. A significant problem facing these organisations is how their information systems will cope with inconsistency being introduced from external data sources. Major problems arise when low quality information enters an authoritative enterprise system from these external sources, and in so doing gains credibility. This problem is compounded by the propagation of this information to other systems and other enterprises, potentially 'invading' an inter-enterprise network. In this paper we will introduce and examine this behaviour, which we term 'information invasion'. Characterisation of systems that are most vulnerable from such an occurrence is provided, and details of an experiment are given which simulates information invasion on an example network topology.

Title:

KNOWLEDGE TRANSFER TO AND AMONG END-USERS IN PRE-PACKAGED ENTERPRISE APPLICATION SOFTWARE IMPLEMENTATION: AN EXPLORATORY STUDY OF THE ROLES OF COMMUNITIES OF PRACTICE

Author(s):

Jimmy Tanamal

Abstract: This paper is concerned with the roles of Communities of Practice (CoPs) in knowledge transfer during the implementation of a particular IT artefact, i.e. the Pre-packaged Enterprise Application Software (PEAS) or also known as Enterprise Resource Planning (ERP) software. Using an in-depth longitudinal case-study across different stages of a Financial PEAS implementation in a large Australian university, we assess the effectiveness and applicability of the practices of CoPs for transferring the PEAS knowledge to and among end-users. The key finding of this paper is that CoPs can be utilized to enhance knowledge transfer for a better PEAS implementation result. Our findings also indicate that CoPs can be assigned to steward this dynamic PEAS knowledge in its most updated version among the very people who are its owners.

Title:

AN OBJECT ORIENTED APPROACH FOR DOCUMENT MANAGEMENT

Author(s):

Abdul Adamu , Souheil Khaddaj

Abstract: It is already widely accepted that the use of data abstraction in object oriented modelling enables real world objects to be well represented in information systems. In this work we are particularly interested with the use of object oriented techniques for document management. Object orientation is well suited for such systems, which require the ability to handle multiple types content. However, the matter of how to deal with the reuse and management of existing documents over time remains a major issue. This paper aims to investigate a conceptual model, based on object versioning techniques, that will represent the semantics in order to allow the continuity and pattern of changes of documents to be determined over time.

Title:

HEALTH CARE PROCESS BASED ON THE ABC MODEL THROUGH A META-STRUCTURED INFORMATION SYSTEM

Author(s):

Christine  VERDIER , Gérard CLUZE

Abstract: We propose in this article to define a system which generates a generic care process based on the ABC method. For this purpose, we adapt dynamically the medical information system with UML packages in order to generate some semantic and syntactic links between the different packages that represent the “business objects” of a hospital. These packages contain all the information related to a specific problem for all the patients. So we are able to extract the particular data concerning a criteria (diagnosis, IP number, etc.) and a patient and, in that manner, to re-build the care process. The ABC method gives the skeleton of the care process and allows the definition of costs on a particular care process (e.g. the care process of the patient “John” concerning the disease “kidney failure” in the hospital H).

Title:

A DATA WAREHOUSE FOR WEATHER INFORMATION

Author(s):

Jose Torres-Jimenez , José  Torres Jímenez

Abstract: Data warehouse related technologies, allows to extract, group and analyze historical data in order to identify information valuable to decision making processes. In this paper the implementation of a weather data warehouse (WDW) to store Mexico’s weather variables is presented. The weather variables data were provided by the Mexican Institute for Water Technologies (IMTA), the IMTA does research, development, adaptation, human resource formation and technology transfer to improve the Mexico’s water management, and in this way contribute to the sustainable development of Mexico. The implemented WDW contains two dimension tables (one time dimension table and, one geographical dimension table) and one fact table (that stores the data values for weather variables). The time dimension table spans over ten years from 1980 to 1990. The geographical dimension table involves many Mexico’s hydrological zones and comes from 5551 measuring stations. The WDW enables (through the dimensions navigation) the identification of weather patterns that would be useful for: a) agriculture politics definition; b) climatic change research; and c) contingency plans over weather extreme conditions. Even it is well known, but it is important to mention, that the data warehouse paradigm (in many cases) is better to derivate knowledge from the data in comparison to the database paradigm, a fact that was confirmed through the WDW exploitation

Title:

INTEGRATION, FLEXIBILITY AND TRANSVERSALITY: ESSENTIAL CHARACTERISTICS OF ERP SYSTEMS

Author(s):

Louis Raymond , Sylvestre Uwizeyemungu

Abstract: The interest of firms in ERP systems has been echoed in both the scientific and professional literature. It is worth noting however that while this literature has become increasingly abundant, there does not yet exist an operational definition of the ERP concept that is, if not unanimously, at least widely accepted. This constitutes a handicap for both the research and practice communities. The present study outlines what could be considered as an ERP by first determining the essentially required characteristics of such a system : integration, flexibility and transversality. Indicators are then provided in order to operationalise these three characteristics. The study concludes by proposing a research framework on the impact of an ERP’s key characteristics upon the performance of the system in a given organisational setting.

Title:

SEMANTIC INTEGRATION OF DISPARATE DATA SOURCES IN THE COG PROJECT

Author(s):

Jos de Bruijn

Abstract: We present a novel approach to the integration of structured information sources in enterprises, based on Semantic Web technology. The semantic information integration approach presented in this paper was applies in the COG project. We describe Unicorn's Semantic Information Management along with the Unicorn Workbench tool, which is a component part of the Unicorn System, and how they were applied in the project to solve the information integration problem. We used the Semantic Information Management Methodology and the Unicorn Workbench tool to create an Information Model (an ontology) based on data schemas taken from the automotive industry. We map these data schemas to the Information Model in order to make the meaning of the concepts in the data schemas explicit and relate them to each other, thereby creating an information architecture that provides a unified view of the data sources in the organization.

Title:

IMPROVING VIEW SELECTION IN QUERY REWRITING USING DOMAIN SEMANTICS

Author(s):

Qingyuan Bai , Michael F. McTear , Jun Hong

Abstract: Query rewriting using views is an important issue in data integration. Several algorithms have been proposed, such as the bucket algorithm, the inverse rules algorithm, the SVB algorithm, and the MiniCon algorithm. These algorithms can be divided into two categories. The algorithms of the first category are based on use of buckets while the ones of the second category are based on use of inverse rules. The bucket-based algorithms have not considered the effects of integrity constraints, such as domain semantics, functional and inclusion dependencies. As a result, they might miss query rewritings or generate redundant query rewritings in the presence of these constraints. A bucket-based algorithm consists of two steps. The first step is called view selection that selects views relevant to a given query and puts the views into the corresponding buckets. The second step is to generate all the possible query rewritings by combining a view from each bucket. In this paper, we consider an improvement of view selection in the bucket-based algorithms using domain semantics. We use the resolution method to generate a pseudo residue for each view given a set of domain semantics. Given a query, the pseudo residue of each view is compared with it and any conflict that exists can be found. As a result, irrelevant views can be removed even before a bucket-based algorithm is used.

Title:

THE ABORTION RATE OF LAZY REPLICATION PROTOCOLS FOR DISTRIBUTED DATABASES.

Author(s):

Luis  Irún-Briz

Abstract: Lazy update protocols have proven to have an undesirable behavior due to their high abortion rate in scenarios with high degree of access conflicts. In this paper, we present the problem of the abortion rate in such protocols from an statistical point of view, in order to provide an expression that predicts the probability of an object to be out of date during the execution of a transaction. It is also suggested a pseudo-optimistic technique that makes use of this expression to reduce the abortion rate caused by accesses to out of date objects. The proposal is validated by means of simulations of the behavior of the expression. Finally, the application of the presented results to improve lazy update protocols is discussed, providing a technique to theoretically determine the boundaries of the improvement.

Title:

NEW FAST ALGORITHM FOR INCREMENTAL MINING OF ASSOCIATION RULES

Author(s):

yasser El-Sonbaty , Rasha Kashef

Abstract: Mining association rules is a well-studied problem, and several algorithms were presented for finding large itemsets. In this paper we present a new algorithm for incremental discovery of large itemsets in an increasing set of transactions. The proposed algorithm is based on partitioning the database and keeping a summary of local large itemsets for each partition based on the concept of negative border technique. A global summary for the whole database is also created to facilitate the fast updating of overall large itemsets. When adding a new set of transactions to the database, the algorithm uses these summaries instead of scanning the whole database, thus reducing the number of database scans. The results of applying the new algorithm showed that the new technique is quite efficient, and in many respects superior to other incremental algorithms like Fast Update Algorithm (FUP) and Update Large Itemsets (ULI).

Title:

WISH QUERY COMPOSER

Author(s):

Gregory Butler

Abstract: The WISH (With Intuitive Search Help) Query Composer is a software tool for composing form-based queries and their associated reports for relational databases. It incorporates the SQL and XML industry standards to generate user-friendly customizable queries and reports. It uses the very simple but flexible XML semantics to represent database schemas, SQL queries and result datasets, regardless of in which relational database management system (RDBMS) the data is stored. The tool is developed in the Eclipse development environment using the Java programming language with Swing components, and connects to the database through Java Database Connectivity (JDBC). The Java Architecture for XML Binding (JAXB) is used to automate the mapping between XML documents and Java objects.

Title:

AN EXCHANGE SERVICE FOR FINANCIAL MARKETS

Author(s):

Fethi Rabhi , Feras Dabous , Hairong Yu

Abstract: The critical business requirements and compelling nature of the competitive landscape are pushing Information Technology systems away from the traditional centrally controlled corporate-wide architectures towards dynamic, loosely coupled, self-defining and service-based solutions. Web services are regarded as a key technology for addressing the need for connecting extended applications and providing standards and flexibility for enterprise legacy systems integration. This paper reports our experiences when integrating a financial market trading system. This integration process starts from analysing the trading system’s architecture, then identifying system functionality and finally realising the design and implementation of a Web service. Performance and security and the trade-offs involved are the major focus points throughout this process. Comprehensive benchmarking is conducted with and without Web service and security considerations.

Title:

DYNAMIC CHANGE OF SERVER ASSIGNMENTS IN DISTRIBUTED WORKFLOW MANAGEMENT SYSTEMS

Author(s):

Manfred Reichert

Abstract: Process-oriented application systems can only be realized -- with reasonable effort and at acceptable costs -- by the use of a workflow management system (WfMS). Central WfMS, with a single server controlling all workflow (WF) instances, however, may become overloaded very soon. In the WF literature, therefore, many approaches suggest using a multi-server WfMS with distributed WF control. In such a distributed WfMS, the concrete WF server for the control of a particular WF activity is usually defined by an associated server assignment. Following such an approach, problems may occur if components (WF servers, subnets, or gateways) become overloaded or break down. As we know from other fields of computer science, a favorable approach to handle such cases may be to dynamically change hardware assignment. This corresponds to the dynamic change of server assignments in WfMS. For the first time, this paper analyses to what extend this approach is reasonable in such situations.

Title:

A/D CASE: A NEW HEART FOR FD3

Author(s):

Manuel Enciso

Abstract: In [anonymous] we introduce the Functional Dependencies Data Dictionary (FD3) as an architecture to facilitate the integration of database Systems. We propose the use of logics based on the notion of Functional Dependencies (FD) to allows formal specification of the objects of a data model and to conceive future automated treatment. The existence of a FD logic provides a formal language suitable to carry out integration tasks and eases the design of an automatic integration process based in the axiomatic system of the FD logic. Besides that, FD3, provides a High Level Functional Dependencies (HLFD) Data Model which is used in a similar way as the Entity/Relationship Model. In this paper, we develop a CASE tool named A/D CASE (Attribute/Dependence CASE) that illustrates the practical benefits of the FD3 architecture. In the development of A/D CASE we have taken into account other theoretical results which improve our original FD3 proposal [anonymous]. Particularly: * A new functional dependencies logic named SLfd, for removing redundancy in a database sub-model that we present in [anonymous]. The use of SLfd add formalization to software engineering process. * An efficient preprocessing transformation based on the substitution paradigm that we present in [anonymous]. Unlike A/D CASE is independent from the Relational Model, it can be integrated into different database systems and it is compatible with relational DBMSs.

Title:

EFFICIENT QUERYING OF TRANSFORMED XML DOCUMENTS

Author(s):

Georg Birkenheuer , Stefan Böttcher , Sven Groppe

Abstract: An application using XML for data representation requires the transformation of XML data if the application accesses XML data of other applications, or of a global database using another XML format. The common approach transforms entire XML documents from one format into another e.g. by using an XSLT stylesheet. The application can then work locally on a copy of the original document transformed in the application-specific format. Different from the common approach, we use an XSLT stylesheet in order to transform a given XPath query such that we retrieve and transform only that part of the XML document which is sufficient to answer the given query. Among other things, our approach avoids problems of replication, saves processing time and in distributed scenarios, transportation costs. Experimental results of a prototype prove that our approach is scalable and efficient.

Title:

ATTENUATING THE EFFECT OF DATA ABNORMALITIES ON DATA WAREHOUSES

Author(s):

Orlando Belo , Anália Lourenço

Abstract: Today’s informational entanglement makes it crucial to enforce adequate management systems. Data warehousing systems appeared with the specific mission of providing adequate contents for data analysis, ensuring gathering, processing and maintenance of all data elements thought valuable. Data analysis in general, data mining and on-line analytical processing facilities, in particular, can achieve better, sharper results, because data quality is finally taken into account. The available elements must be submitted to an intensive processing before being able to integrate them into the data warehouse. Each data warehousing system embraces extraction, transformation and loading processes which are in charge of all the processing concerning the data preparation towards its integration into the data warehouse. Usually, data is scoped at several stages, inspecting data and schema issues and filtering all those elements that do not comply with the established rules. This paper proposes an agent-based platform, which not only ensures the traditional data flow, but also tries to recover the filtered data when an data error occurs. It is intended to perform the process of error monitoring and control automatically. Bad data is processed and eventually repaired by the agents, integrating it again into the data warehouse’s regular flow. All data processing efforts are registered and afterwards mined in order to establish data error patterns. The obtained results will enrich the wrappers knowledge about abnormal situations’ resolution. Eventually, this evolving will enhance the data warehouse population process, enlarging the integrated volume of data and enriching its actual quality and consistency.

Title:

A HYBRID APPROACH FOR EFFICIENT STORAGE AND RETRIEVAL OF MULTIDIMENSIONAL DATA

Author(s):

Jagdish K.T. , Srivani T.K.

Abstract: Mapping from multidimensional data to one-dimensional using Hilbert Index has been studied as a way of indexing for storage and retrieval of multidimensional data. There are mainly two approaches towards Storage and Retrieval of Multidimensional data (Jurgens, 2002) one is the Tree Based Approach and other is Bitmap Indexing. One main benefit of the tree-based approach over the bit map indexing is that they have superior storage property and the insert/update operations are efficient on the other hand the bitmap indexing provides for faster retrieval. Our data structure is mainly based on the tree-based approach in which every node of the tree contains a bit array. The presence of a bit array in every node provides for faster retrieval thereby giving the benefit of both the approaches. In this paper, we present a tree (HT-tree) based on Hilbert Curves for efficient data storage and retrieval of Multidimensional data. The HT-tree data search method mainly makes use of the bit representation of the Hilbert Index values to search for the data, instead of using conventional point search methods as used in most of the R-trees. The proposed data structure overcomes the disadvantages of the HG-tree namely, extra computation of minimum bounding rectangle from the range of Hilbert values required for point search, range search and nearest neighbour search and also the problems occurring from the overlap area and redundant searches.

Title:

RELATIONAL SAMPLING FOR DATA QUALITY AUDITING AND DECISION SUPPORT

Author(s):

José Nuno Oliveira , Bruno Cortes

Abstract: This paper presents a strategy for applying sampling techniques to relational databases, in the context of data quality auditing or decision support processes. Fuzzy cluster sampling is used to survey sets of records for correctness of business rules. Relational algebra estimators are presented as a data quality-auditing tool.

Title:

TURNING INFORMATION INTO ACTION: FROM DATA TO BUSINESS PROCESSES THROUGH WEB SERVICES

Author(s):

Youcef Baghdadi

Abstract: Sharing Web services across the enterprise and to support business-to-business integration becomes more and more intensive and critical for businesses. This paper proposes a process to generate Web services from the attributes of the business objects and coordination artifacts as described in the highest abstraction level of a business model i.e. the universe of discourse where the elements are unique and not duplicated. Indeed, the elements of the information system, technology-based representation of the universe of discourse, are complex and redundant. The process is based on the concept of factual dependency. The factual dependency is a mechanism that allows aggregations of the attributes that are concerned by the same CRUD operations with respect to the time and the space. Factual dependencies are then validated with respect to the possible business events to keep only the relevant ones. Each distinct and specified operation in terms of input/output parameters generates a lowest level of granularity Web service. These Web services are then registered to be discovered and (re)used at request by any business process.

Title:

LIFESTREAMS: BRAIN-FRIENDLY DATA ACCESS

Author(s):

Jussi Kangasharju , Tobias Limberger , Gerhard Austaller

Abstract: Modern databases are rapidly growing in size and complexity. However, many users do not have enough domain knowledge to formulate precise queries and are thus unable to use these databases to their full potential. In this paper we present our LifeStreams project which aims at a brain-friendly access to data using associations between documents. Associations in LifeStreams are based on examining similarities between documents in several metadata dimensions such as time, location, and keywords. We present a model for real world and abstract entities and discuss how the relationships between entities and documents can be established. We show how LifeStreams visualizes collections of documents using a 3-dimensional visualization technique. We also discuss real-world application scenarios for LifeStreams in a corporate environment.

Title:

AN METHOD BASED ON CHAOTIC AND FRACTAL CONTROL FOR SOFTWARE QUALITY - AN EXPERIENCE

Author(s):

ZHANG Kai

Abstract: Despite the fact that great efforts have been made, there still have been major software problems unsolved, such as overtime and low quality. The chaotic and fractal have become a focal research field recent years, but there are only two papers to study the software quality by chaos tool. The purpose of this paper is to explore an approach how to early control software quality by the chaotic and fractal tools. After the analysis for the growing process of the software defects, the authors believe that the software defect growth has chaotic fractal characteristic, and design a method based on the chaotic and fractal control for process management of software quality. Two experiments have testified to the control efficiency.

Title:

IMPROVING QUERY PERFORMANCE ON OLAP-DATA USING ENHANCED MULTIDIMENSIONAL INDICES

Author(s):

Yaokai Feng , Hiroshi  Ryu , Akifumi Makinouchi

Abstract: Multidimensional indices are efficient to improve the query performance on OLAP data. As one multidimensional index structure, R*-tree is popular and successful, which is a member of the famous R-tree family. We enhance the R*-tree to improve the performance of range queries on OLAP data. First, the following observations are presented. (1) The clustering pattern of the tuples (of the OLAP data) among the R*-tree leaf nodes is a decisive factor on range search performance and it is controllable. (2) There often exist many slender nodes when the R*-tree is used to index business data, which causes some problems both with the R*-tree construction and with queries. And then, we propose an approach to control the clustering pattern of tuples and propose an approach to solve the problems of slender nodes, where slender nodes refer to those having a very narrow side (even the side length is zero) in some dimension. Our proposals are examined by experiments using synthetic data and TPC-H benchmark data.

Title:

MANAGING WEB-BASED INFORMATION

Author(s):

Tullio  Vernazza , Giancarlo Succi , Alberto  Sillitti , Marco Scotto

Abstract: The heterogeneity and the lack of structure of World Wide Web make automated discovery, organization, and management of Web-based information a non-trivial task. Traditional search and indexing tools provide some comfort to users, but they generally provide neither structured information nor categorize, filter, or interpret documents in an automated way. In recent years, these factors have prompted the need for developing data mining techniques applied to the web, giving rise to the term “Web Mining”. This paper introduces the problem of web data extraction and gives a brief analysis of the various techniques to address it. Then, News Miner, a tool for Web Content Mining applied to the news retrieval is presented.

Title:

ADVANTAGES OF UML FOR MULTIDIMENSIONAL MODELING

Author(s):

Sergio Luján-Mora , Juan Trujillo , Panos Vassiliadis

Abstract: In the last few years, various approaches for the multidimensional (MD) modeling have been presented. However, none of them has been widely accepted as a standard. In this paper, we summarize the advantages of using object orientation for MD modeling. Furthermore, we use the UML, a standard visual modeling language, for modeling every aspect of MD systems. We show how our approach resolves elegantly some important problems of the MD modeling, such as multistar models, shared hierarchy levels, and heterogeneous dimensions. We believe that our approach, based on the popular UML, can be successfully used for MD modeling and can represent most of frequent MD modeling problems at the conceptual level.

Title:

SEMI-STRUCTURED INFORMATION WAREHOUSES: AN APPROACH TO A DOCUMENT MODEL TO SUPPORT THEIR CONSTRUCTION

Author(s):

Juan Manuel Pérez Martínez , Rafael Berlanga Llavori , Maria Jose Aramburu Cabo

Abstract: During the last decade, data warehouse and OLAP techniques have helped companies to gather, organize and analyze the structured data they produce. Simultaneously, digital libraries have applied Information Retrieval mechanisms to query their repositories of unstructured documents. In this context, the emergence of XML means the convergence of these two approaches, making possible the development of warehouses for semi-structured information. Although there exist several extensions of traditional data warehouse technology to manage semi-structured information, none of them are based on an underlying document model able to exploit this kind of information. Along this paper we present a set of requirements for semi-structured warehouses, as well as a document model to support their construction.

Title:

FACILITATING BUSINESS PROCESS MANAGEMENT WITH HARMONIZED MESSAGING

Author(s):

Shazia Sadiq , Maria Orlowska , Wasim Sadiq , Karsten Schulz

Abstract: Process communication is characterized by complex interactions between heterogeneous and autonomous systems within the enterprise and often between trading partners. A number of initiatives and proposals are underway to provide solutions for process specification and communication. However, the focus is often on defining APIs and interfaces rather than the semantics of the underlying message exchange. We see a great potential in the enhancement of current messaging infrastructure, in its new role in facilitating complex, long running interactions for dynamic and collaborative processes operating in decentralized environments like the World-Wide Web. In this paper, we primarily present a vision for a technology aimed at providing a level of business logic on the messaging layer, which we denominate as harmonisation of messages.. We will provide the conceptual framework for the harmonized messaging technology and identify fundamental issues for the specification of complex interactions.

Title:

MINING CLICKSTREAM-BASED DATA CUBES

Author(s):

Orlando Belo , Ronnie Alves

Abstract: Clickstream analysis can reveal usage patterns on company’s web sites giving highly improved understanding of customer behaviour, which can be used to improve customer satisfaction with the website and the company in general, yielding a great business advantage. Such summary information and rules have to be extracted from very large collections of clickstreams in web sites. This is challenging data mining, both in terms of the magnitude of data involved, and the need to incrementally adapt the mined patterns and rules as new data is collected. In this paper, we present some guidelines for implementing on-line analytical mining (OLAM) engines which means an integration of OLAP and mining techniques for exploring multidimensional data cube structures. In addition, we describe a data cube alternative for analyzing clickstreams. Besides, we discussed implementations that we consider efficient approaches on exploring multidimensional data cube structures, such as DBMiner, WebLobMiner, and OLAP-based Web Access Engine.

Title:

TRANSACTION CONCEPTS FOR SUPPORTING CHANGES IN DATA WAREHOUSES

Author(s):

Zbyszko Krolikowski , Robert Wrembel , Bartosz Bebel

Abstract: A data warehouse (DW) provides an information, from external data sources, for analytical processing, decision making, and data mining tools. External data sources are autonomous, i.e. they change over time, independently of a DW. Therefore, the structure and content of a DW has to be periodically synchronized with its external data sources. This synchronization concerns DW data as well as schema. Concurrent work of synchronizing processes and user queries may result in various anomalies. In order to tackle this problem we propose to apply a multiversion data warehouse and an advanced transaction mechanism to a DW synchronization.

Title:

AN ALTERNATIVE APPROACH FOR BUILDING WEB-APPLICATIONS

Author(s):

Oleg Rostanin

Abstract: Nowadays in J2EE-world there is a lot of blueprints, articles and books that propose some recommendations, recipes and patterns for producing web-applications in right way. There are also ready decisions like Jakarta Struts that can be taken as a base of a new project development. While developing the DaMiT e-learning system we tried to collect, analyse and implement many of the architectural features being proposed as well as to invent some new mechanisms such as supporting multiple kinds of client software or introducing XML-based interfaces between application tiers.

Title:

RJDBC: A SIMPLE DATABASE REPLICATION ENGINE

Author(s):

Javier Esparza Peidro

Abstract: Providing fault tolerant services is a key question among many services manufacturers. Thus, enterprises usually acquire complex and expensive replication engines. This paper offers an interesting choice to organizations which can not afford such costs. RJDBC stands for a simple, easy to install middleware, placed between the application and the database management system, intercepting all database operations and forwarding them among all the replicas of the system. However, from the point of view of the application, the database management system is accessed directly, so that RJDBC is able to supply replication capabilities in a transparent way. Such solution provides acceptable results in clustered configurations. This paper describes the architecture of the solution and some significant results.

Title:

TOWARDS DESIGN RATIONALES OF SOFTWARE CONFEDERATIONS

Author(s):

Michal Zemlicka

Abstract: The paper discuss reasons why service-oriented architecture is a new software paradigm and the consequences of this fact for the design of enterprise information systems. It is shown that such systems called confederations need not (should not) use web services in the sense of W3C which are more or less a necessity in e-commerce. As business processes supported by enterprise systems must be supervised by businessmen, the same must hold for ccommunication inside confederations. It implies that the interfaces of the services must be user-oriented (user-firendly). It has possitive consequences for the software engineering properties of the confederation. Confederations should sometimes include parts based on a difficult implementation philosophy (e.g. data orientation). Pros and cons of it are discussed. Open issues of service orientation are presented.

Title:

SOLVING INTEROPERABILITY PROBLEMS ON A FEDERATION OF SOFTWARE PROCESS SYSTEMS

Author(s):

Mohamed-Amine MOSTEFAI , Mohamed AHMED-NACER

Abstract: Software process components that share information and that cooperate for common tasks lead to multiple problems of interoperability for software process support environments based on a federation of heterogeneous and autonomous components. Some based-interoperability approaches have been proposed, especially at the conceptual level. However, more problems remain to be solved to enable the heterogeneous process components interoperability at execution level. This paper presents a process-based approach (architecture) for the federation of software process systems. Based on this federation architecture, we focuss on its implementation problems for the process execution interoperability. We show how we solve these problems and we discuss their implementation through the main development platforms of distributed applications.

Title:

VERSION MANAGEMENT FOR DATA WAREHOUSE EVOLUTION

Author(s):

Alexandre Schlottgen , Nina Edelweiss

Abstract: Various multidimensional data models were proposed in the last years for Data Warehouse (DW) modeling. However, there is a considerable shortage of models that deal with DW schema evolutions. In order to understand the DW life cycle and guarantee the correct and consistent maintenance of the populated data, it is necessary to control the modifications made at multidimensional schemata. This article studies the DW schema modification operations, presenting an extension to ME/R (Multidimensional Entity Relationship Model) to support the multiple versions management of DW schemata.

Title:

A RESPONSIBILITY-DRIVEN ARCHITECTURE FOR MOBILE ENTERPRISE APPLICATIONS

Author(s):

Qusay Mahmoud

Abstract: This paper deals with wireless applications that get downloaded, over the air, on handheld wireless devices and get executed there. Once running, they may need to interact with applications residing on remote wired servers. The motivation for this work is provided in part by the characteristics of the wireless computing environment. There are several implications of these characteristics that require a software architecture that reduces the load on the wireless link and supports disconnected operations. We present a responsibility-driven architecture that enables mobile thin-clients to interact with enterprise servers. We extend this architecture with mobile agent to reduce the load on the wireless link and support disconnected operations. This architecture is capable of supporting multiple devices with or without a client browser.

Title:

DESIGN AND REPRESENTATION OF THE TIME DIMENSION IN ENTERPRISE DATA WAREHOUSES - A BUSINESS RELATED PRACTICAL APPROACH

Author(s):

Ahmed Hezzah , A Min Tjoa

Abstract: A data warehouse provides a consistent view of business data over time. In order to do that data is represented in logical dimensions, with time being one of the most important dimensions. Representing time, however, is not always straightforward due to the complex nature of time issues and the strong dependence of the time dimension on the type of business. This paper addresses the specific issues encountered during the design of the time dimension for multidimensional data warehouses. It introduces design and modeling techniques for representing time in the data warehouse by the use of one or multiple time dimensions or database timestamps. It also discusses generic problems linked to the design and implementation of the time dimension which have to be considered for (global) business processes, such as representing holidays and fiscal periods, increasing the granularity of business facts, considering the observation of daylight saving time and handling different time zones. These problems seem to have wide application, and yet, more in-depth investigations need to be conducted in this field for real-world time-based analysis in enterprise-wide data warehouses.

Title:

A METHOD FOR XML DOCUMENT SCHEMA EVOLUTION

Author(s):

Lina Al-Jadir

Abstract: XML has become an emerging standard for data representation and data exchange on the Web. Although XML data is self-describing, most application domains tend to use document schemas. Over a period of time, these schemas need to be modified to reflect a change in the real-world, a change in the user’s requirements, mistakes or missing information in the initial design. Most of the current XML management systems do not support schema changes. In this paper, we propose a method to manage XML document schema evolution. We consider XML documents associated with DTDs. Our method consists in three steps. First, the DTD and XML documents are stored as a database schema and a database instance respectively. Second, DTD changes are applied as schema changes on the database. Third, the updated DTD and XML documents are retrieved from the database. Our method supports a complete set of DTD changes. The semantics of each DTD change is defined by preconditions and postactions, such that the new DTD is valid, existing XML documents conform to the new DTD, and data is not lost if possible. We implemented our method in an object-oriented database system.

Title:

PROPOSAL FOR AUTOMATING THE GENERATION PROCESS OF QUESTIONNAIRES TO MEASURE THE SATISFACTION LEVEL OF SOFTWARE USERS

Author(s):

María Inés Lund , Sergio Zapata , Mauro Paparo

Abstract: The most recent concepts on software quality take into account the factors of product quality, process quality and the satisfaction level of users. Therefore, when putting forth a plan for improving a software product, special attention should be paid as to incorporate the level of users’ satisfaction into the development premises. On this latter respect, well-designed surveys have proven to be a valuable tool to obtain and measure satisfaction variables. The survey-based strategies, however, present a drawback on the fact that the tasks involved in questionnaire generation are difficult to automate, which renders the entire approach almost impracticable. This work presents a proposal for automating the various stages defined in questionnaire generation, with the aim at making the measurement method be both applicable and more practical.

Title:

ONTOEDITOR: A WEB TOOL FOR MANIPULATING ONTOLOGIES STORED IN DATABASE SERVERS

Author(s):

Claudio de Souza Baptista , Karine Freitas Vasconcelos , Ulrich Schiel , Ladjane Silva Arruda , Elvis Rodrigues da Silva

Abstract: The Web is moving to a new generation in which machine-understandable processing is mandatory. In order to achieve this goal it is essential to define ontologies which enable the modeling of application domains and can be shared and understood by different applications in different platforms. These ontologies are complex and so it is necessary to provide software tools which aims to facilitate ontology manipulation. In this paper, we describe a new tool for ontology manipulation known as OntoEditor. OntoEditor is a Web tool, which has a graphical interface for representing an ontology graph. Moreover, OntoEditor uses a database management system for ontology persistency and query manipulation. The ontologies are represented internally as RDF and RDF Schema.

Title:

REFERENCIAL INTEGRITY MODEL FOR XML DATA INTEGRATED FROM HETEROGENEOUS DATABASES SYSTEMS

Author(s):

Mauri Ferrandin

Abstract: This article presents a proposal for maintenance of the referential integrity in data integrated from relational heterogeneous databases stored in XML materialised views. The core idea is the creation of a rules repository that will have to be observed to if carrying through any operation of update in the mediating layer of a system for integration of heterogeneous relational sources of data to guarantee that the updates carried through in the data stored in this layer can be propagated to the relational databases that are part of the system integrated without causing problem of referential integrity in the same ones. This proposal has as main objective to specify a mechanism capable to guarantee that the data after exported from the relational heterogeneous databases in a mediating layer, continue respecting the same integrity which these were submitted in the origin databases.

Title:

MODEL BASED MIDDLEWARE INTEGRATION

Author(s):

Frédérick Seyler

Abstract: In this paper, we describe a process and a meta model that we are defining for the reuse of legacy based systems. This aims at filing the gap between design level bridges and the implementation of interoperability. Our proposal is a component based integration process, a metamodel based on welle known component research results and a reuse architecture allowing an operational integration of legacy applications. The metamodel, called Ugatze is composed by a set of UML packages covering multiple Viewpoints of the reuse activity. Ugatze is the Basque name for the Bearded Vulture, it reuses bones of death animals to eat, and its re-integration in Basque Country seems to be difficult, but it is a challenge.

Title:

REAL-TIME DATABASES FOR SENSOR NETWORKS

Author(s):

Maria Lígia Barbosa Perkusich , Pedro Fernandes  Ribeiro Neto , Angelo Perkusich

Abstract: In the last years, the demand of embedded systems has been increased. Also, due to the increasing competition among different kind of companies, such as cellular phone, automobiles and industrial automation, the requirements for such systems are getting more complex. However, the data storage and processing techniques, for these environments, are insufficient for the new requirements. In this paper, we develop a model for the integration of real-time database technology with an embedded sensor network systems, to tackle such deficiencies.

Title:

MEMORY MANAGEMENT FOR LARGE SCALE DATA STREAM RECORDERS

Author(s):

Zimmermann Roger , Kun Fu

Abstract: Presently, digital continuous media (CM) are well established as an integral part of many applications. In recent years, a considerable amount of research has focused on the efficient retrieval of such media. Scant attention has been paid to servers that can record such streams in real time. However, more and more devices produce direct digital output streams. Hence, the need arises to capture and store these streams with an efficient data stream recorder that can handle both recording and playback of many streams simultaneously and provide a central repository for all data. In this report we investigate memory management in the context of large scale data stream recorders. We are especially interested in finding the minimal buffer space needed that still provides adequate resources with varying workloads. We show that computing the minimal memory is an NP-complete problem and will require further research to find efficient heuristics.

Title:

CONVERTING LEGACY RELATIONAL DATABASE INTO XML DATABASE THROUGH REVERSE ENGINEERING

Author(s):

Anthony Lo , Reda Alhajj , Ken Barker , Chunyan Wang

Abstract: XML (eXtensible Markup Language) has emerged and is being gradually accepted as the standard for data interchange over the Internet. Since most data is currently stored in relational database systems, the problem of converting relational data into XML assumes special significance. Many researchers have already done some accomplishments in this direction. They mainly focus on finding XML schema (e.g., DTD, XML-Schema, and RELAX) that best describes a given relational database with a corresponding well-defined database catalog that contains all information about tables, keys and constraints. However, not all existing databases can provide the required catalog information. Therefore, these applications do not work well for legacy relational database systems that were developed following the logical relational database design methodology, without being based on any commercial DBMS, and hence do~not provide well-defined metadata files describing the database structure and constraints. In this paper, we address this issue by first applying the reverse engineering approach to extract the ER (Extended Entity Relationship) model from a legacy relational database, then convert the ER to XML Schema. The proposed approach is capable of reflecting the relational schema flexibility into XML schema by considering the mapping of binary and nary relationships. We have implemented a first prototype and the initial experimental results are very encouraging, demonstrating the applicability and effectiveness of the proposed approach.

Title:

KEYS GRAPH - BASED RELATIONAL TO XML TRANSLATION ALGORITHM

Author(s):

Christine  VERDIER , Wilmondes MANZI DE ARANTES

Abstract: The authors propose two algorithms for generating a DTD and an XML document respectively from the metadata and the content of a relational database without any intermediary language or user intervention. Such algorithms always generate semantically correct XML output by respecting database functional dependencies represented in a graph structure they take as input. Finally, different XML representations (or views) meeting expectations of different kind of users can be obtained from the same data according to the database entity chosen as translation pivot

Title:

DURATIVE EVENTS IN ACTIVE DATABASES

Author(s):

Juan Carlos Augusto , Rodolfo Gomez

Abstract: Active databases are DBMS which are able to detect certain events in the environment and trigger actions in consequence. Event detection has been subject of much research, and a number of different event specification languages is extant. However, this is far from being a trivial or accomplished task. Most of these languages handle just instantaneous events, but it has been noticed that a number of situations arise where it would be interesting or even necessary to handle durative events. We elaborate on a given specification language which combines instantaneous and durative events, revealing some issues which must be taken into account when the semantics of event composition is defined.

Title:

EMULATIVE SOFTWARE ENGINEERING - AN EXPERIMENT AND EXPERIENCE

Author(s):

Xiong Qianxing , Zhang Zhang Kai

Abstract: Concurrent Engineering is a good method, but the method overstress the communication and cooperation of various departments in an enterprise so that it does not meet the requirement of fast tempo and direct confrontation of modern work. This paper proposes a so-called Emulative Software Engineering and reposes its hope in the method to solve the difficult problems of software quality and schedule control. The authors did a development experiment based on a teaching activity, which three groups joined. The experimental result and data show that the method is feasible or workable. It is reasonably believed that the method has the great and practical value for software development in spite of the fact that it originates from a new idea and software development of teaching activity. In addition, the results of the experiment indicate that Emulative Software Engineering has the weakness in information exchange, which needs to be rectified with the support from the strong points, close exchange, of both Concurrent Engineering and knowledge management.

Title:

COOPERATIVE LEGACY DATABASES - AN ONTOLOGY BASED CONTEXT MEDIATION

Author(s):

Philippe Thiran , Djamal  Benslimane

Abstract: Enterprise information systems contain collections of existing databases that must cooperate to carry out common tasks. Most often, these databases are legacy, autonomous and heterogeneous systems. In this paper, we focus on the semantic and dynamic aspects of legacy database interoperation. We present a context mediation approach to support legacy database interoperability, which is based on a conceptual level of database description and on a dynamic resolution of structural and semantic conflicts. An object oriented data model is described, which provides tools for a conceptually rich description of legacy database, and foundations for resolving semantic heterogeneities among systems.

Title:

ORGANIZATIONAL INFORMATION SYSTEMS DESIGN AND IMPLEMENTATION WITH CONTEXTUAL CONSTRAINT LOGIC PROGRAMMING

Author(s):

Salvador Abreu

Abstract: In this article we sustain that Contextual Constraint Logic Programming (CxCLP for short) is a useful paradigm in which to specify and implement Organizational Information Systems, particularly when integrated with the ISCO mediator framework. We briefly introduce the language and its underlying paradigm, appraising it from the angle of both of its ancestries: Logic and Object-Oriented Programming. An initial implementation has been developed and is being actively used in a real-world setting -- Universidade de �vora's Academic Information System. We briefly describe both the prototype implementation and its first large-scale application. We conclude that the risk taken in adopting a developing technology such as the one presented herein for a mission-critical system has paid off, in terms of both development ease and flexibility as well as in maintenance requirements.

Title:

WEB-BASED TRAINING SYSTEM FOR FOREST FIRE OFFICE STAFF

Author(s):

Juan Garbajosa

Abstract: The objective of this paper is to present an approach for a web-based training system for Forest fire offices. The development of a modelling and simulation technology for systems with a network-like architecture is a growing day by day. Forest fire offices represents an appropriate application to do this development. The approach described is based on an XML languages family defined in a research project and applied to a number of systems that have been modelled and simulated. This paper introduces two different points of views: the first the system architecture; the second the XML-based language and its use for simulation.

Title:

ARCO: MOVING DIGITAL LIBRARY STORAGE TO GRID COMPUTING

Author(s):

Paulo  Trezentos

Abstract: Storage has been extensively studied during the past few decades \cite{Fost97,Trez01}. However, the emerging trends on distributed computing bring new solutions for existent problems. Grid computing proposes a distributed approach for data storing. In this paper, we introduce a Grid-based system (ARCO) developed for multimedia storage of large ammounts of data. The system is being developed for Biblioteca Nacional, the National Library of Portugal. Using Grid informational system and resources management, we propose a transparent system where TeraBytes of data are stored in a beowulf cluster built of commodity components with backup solution and error recover mechanisms.~

Title:

DATA EXTRACTION AND TRANSFORMATION WITH FLAT FILE FOR BUSINESS INTEGRATION

Author(s):

Sheng Ye , Wei Sun , Zhong Tian

Abstract: Documents and their exchange play important roles in business operations and transactions. With the development of e-business, the capability of exchanging data in different formats is necessary for integrating heterogeneous enterprise applications. Though XML is becoming the standard communication protocol over the Internet, most enterprise applications today can only process a specific format text data, mostly in a flat file. These diverse data formats will continue to exist until the enterprises’ applications are upgraded to a version supporting XML. So the transformation between XML and flat file is widely demanded in business integration solutions. This paper introduces a round trip transformation technology between flat file and XML, Flat File Adapter. This technology employs a systematic and patent pending data extraction and formatting method to support the processing of complex format flat file. By using Flat File Adapter, developer can design the data transformation rules shortly, and these rules will be captured in a template that make it is easy to update for later requirements’ changes. In this paper, we introduce the system architecture, detailed components, and particular data extraction and transformation method. Finally, a sample application in B2B e-procurement solution is also described.

Title:

UNDERSTANDING THE ERP POST-IMPLEMENTATION DISCOURSE

Author(s):

Fergal Carton , Frederic Adam , David Sammon

Abstract: This paper presents the first stage of a larger research project focusing on understanding the emergence of ERP II. ERP is now being seen for what it really is: ‘a means to an end’, in that, its primary benefit is in the integrated infrastructure that it introduces and its ability to support future IS investments. The paper focuses on the changes that have been observed in the services offered by vendors and consultants in the now renamed ERP II market. Now terms like ‘ERP’ and ‘e-business’ are for the most part avoided by vendors and consultants as they are perceived to be out-of-date. For example, SAP once promoted that fact that they were ‘29 years in the business of e-business’ with ‘the best-run e-businesses run SAP’, but now their message promotes, ‘30 years in the business of helping businesses grow’ with ‘the best-run businesses run SAP’. In this paper, issues of concern with the realities of ERP post-implementation are presented through examining: benefits realisation; informational requirements; and generic to specific solutions. While we would argue that it is difficult to understand the rationale for the introduction of these ‘newer’ ERP extensions, we must acknowledge that a market has been created and that once again the ‘new-look’ ERP vendors are the dominant ERP II players. This leads us to question whether there is anything new in ERP II.

Title:

A FRAMEWORK FOR ON-DEMAND INTEGRATION OF ENTERPRISE DATA SOURCES

Author(s):

Tapio Niemi

Abstract: Deploying a data warehouse system in a company is usually an expensive and risky investement. Constructing a data warehouse is a large project that can take very long time. However, a company cannot know in advance exactly what benefits a data warehouse will offer, nor is it easy to predict kind of functionality it should be able to support to be usable in an event where company's processes or structures are changed. Due to these reasons, in many cases data warehousing projects have either been abandoned or been shown to be at least partial failures. We propose a new method by providing a platform to implement business intelligence systems on. The basic idea is to construct the analysis database (i.e. an OLAP cube) on demand and only include the data that is needed for the analysis at hand from the operational databases. In this way the data is always up-to-date, suitable for the current analysis, and some of the biggest risks associated with data warehouse systems can be avoided. The computational costs related to the cube construction are likely to remain at acceptable level, since only the relevant part of the data for the current analysis is needed from operational databases. Moreover, business intelligence systems, like OLAP, are traditionally limited to the data stored in the data warehouse of the company. In many cases this is not enough since the phenomenon under analysis can be dependent on something outside the scope of the company. For example, the oil price or the weather can have a remarkable effect on business. If a decision support system can not have access to this kind of external data, the analysis cannot find the right explanation for the problem. The proposed method enables the user performing the analysis to include external data to the OLAP cube. We outline the use of Grid techologies - a research field closely related to Internet computing - in the implementation to offer a cost-effective way to harness enough computing power used on parallel processing and sufficient security infrastructure (GSI). Another aspect of the Grid is that due to its potential to offer large amounts of storage capacity in a way that optimally leverages the advances in price/capacity ratio of new storage devices, it is possible to retrieve also archived transactional data in transparent manner. To deal with heterogenous data sources the XML language with XSL transformations is applied.

Title:

MEASURING THE IMPACT OF ENTERPRISE SYSTEMS ON BUSINESS OBJECTIVES

Author(s):

Vincent Owhoso , Donald Chand , James Hunton , Sri Vasudevan , George Hachey

Abstract: This is a research-in-progress report on our research project aimed at understanding how to assess the success of ERP systems. Based on an in-depth study of a successful ERP implementation in a multinational manufacturing and service organizations, we have identified sample performance indicators in all four dimensions of Balanced Scorecard. Our study shows that instead of building an ERp scorecard, it is more fruitful to study the impacts of the ERP systems on business goals and strategies.

Title:

INFOFLEX: FLEXIBLE AND DISTRIBUTED CONTENT MANAGEMENT. USING WEB SERVICES AND SEMANTIC WEB TO MANAGE CONTENT

Author(s):

Antonio Hernández Pérez , Tomás Nogales Flores , David Rodríguez Mateos , Luis Sánchez Fernández , Jesús Arias Fisteus , Norberto Fernández García , Jesús Villamor Lugo

Abstract: The development of information and communication technologies and the expansion of the Internet means that nowadays there are huge amounts of information available via these emergent media. The need to manage such information, which was in the past stored on paper media, has become apparent in different fields. A number of content management systems have appeared which aim to achieve this task. Most of these systems are oriented towards Web publishing on a central site, and they do not support collaboration among several, distributed sources of managed content. In this paper we present a proposal for an architecture for the efficient and flexible management of distributed.

Title:

INFORMATION SYSTEM FOR SUPPORTING THE INCLUSION OF JOB SEEKERS TO THE LABOUR MARKET.

Author(s):

Theodoros Alevizos , Christos Skourlas , Paraskevas Hadjidiakos

Abstract: In this paper, the interconnection and integration problem of disparate Information sources including multilingual information related to the Unemployed and Business is analyzed. A possible solution based on the use of the European curriculum vitae and the creation of Data Marts is briefly described. The approach is also influenced by well-known Cross-Lingual Information Retrieval (CLIR) techniques. We also focus on the creation of a pilot Information System for the Institute of Labour (INE) of the Greek General Confederation of Labour (GSEE). Eventually, our experience and a first evaluation of the system are discussed.

Title:

COMPONENT BASED INFORMATION SYSTEM RE-ENGINEERING APPROACH

Author(s):

Abdelaziz  KHADRAOUI , Michel Léonard

Abstract: This paper presents a concept called Component Based Information System Re-Engineering (CISRE), which lays down the foundation of a new re-engineering approach. CISRE covers all the facets of an Information System at three levels: system, collaboration and organization. The proposed approach of IS Re-engineering distinguishes two main phases: the comprehension phase and the renovation one which are not disjointed. The cognitive space of the comprehension phase permits the clarification of links between legal texts (general procedures) and the IS. The main goal is to converge into a new IS, within a rapid evolving environment. Therefore, the new IS will be achieved on stable concepts based on invariants.

Title:

DEPLOYING A SUPPLY CHAIN PORTAL TO TRANSFORM MILITARY OPERATIONS

Author(s):

Robert Sullivan , Sandor Boyson , Robert Stevens

Abstract: This short paper addresses the challenges and anticipated benefits of building and deploying a comprehensive end to end supply chain technology infrastructure for the U.S. Army, layering a portal, middleware, collaborative planning and forecasting applications and integrated ERP software in a rapid deployment process. As noted by Boyson and Corsi a supply chain portal can “harness diverse real time data sources to: -provide a unified format and middleware platform for legacy, enterprise and internet data -personalize views based on user requirements and access classifications -distribute field-based data gathered from scanners, PDA devices and other information appliances to multiple users in real time over the portal Thus, the portal provides a unifying structure allowing a single shared database to coordinate all the transactions within the organization as well as the transactions between the organization and its trading partners in real time”

Title:

DDING SPATIAL COMPONENTS TO SCIENTIFIC DATA WAREHOUSES

Author(s):

Kevin Deeb

Abstract: For many years universities and government agencies have been collecting a wealth of scientific data. It is now time to transform these data into information and make them readily available in a common format that is easily accessible, fast, and bridges the islands of information that have evolved at each site. The best architecture for this application is the data warehouse that protects the confidentiality of data before it can be published by principal investigators, preserves the privacy of contributors, provides sufficient granularity to enable scientists to variously manipulate data, supports robust metadata services, and contains a standardized spatial component. The benefits of the warehouse can be further enhanced by adding a spatial component so that the data can be brought to life, overlapping layers of information in a format that is easily grasped by management, enabling them to tease out trends in their areas of expertise.

Title:

DEVELOPING A CORPORATE INFORMATION SYSTEM ARCHITECTURE: THE CASE OF EUROSTAT

Author(s):

François Vernadat , Georges  Pongas

Abstract: The paper presents the vision being deployed at the Statistical Office of the European Communities (Eurostat) about a rationalised IT infrastructure for integrated operations of its various statistical production systems. The new architecture being implemented isolates physical data from applications and users, uses database federation mechanisms, strongly relies on the use of meta-data about storage systems, application systems and data life cycles, emphasises the use of thematic and support servers and will use a message-oriented middleware as its backbone for data exchange. Portal technology will provide the unique gateway both for internal and external users to have public or restricted access to information produced by over 130 statistical production systems working in the back-office. Architectural principles and solutions are discussed.


AREA 2 - Artificial Intelligence and Decision Support Systems
 

Title:

THE DATA FLOW AND DISTRIBUTED CALCULATIONS INTELLIGENCE INFORMATION TECHNOLOGY FOR DECISION SUPPORT SYSTEM IN REAL TIME

Author(s):

Michael Okhtilev

Abstract: The aim of this investigation is to develop unified models of complex technological process as controlled object states knowledge presentation; methods, algorithms and system of complex technological process states monitoring (situation assessment) programs automatic synthesis accord-ing to preset target and capability of verification and optimization con-sidered; special software prototype realizing controlled objects automatic monitoring.

Title:

STRATEGIC DMSS FOR E-BUSINESS PLANNING

Author(s):

Lidan Ha , Guisseppi  Forgionne , Fen Wang

Abstract: Strategic business planning is a critical decision problem determining the long-term survival and prosperity of companies especially in this E-era. The complex planning process can be facilitated through management science, economics, statistics, and other technological tools. However, managers are rarely aware of these tools, are not proficient in their use, or are incapable of acquiring the proficiency. Through theoretical exploration in previous phases of an ongoing project, we believed that such proficiency can be delivered through decision making support systems. The current study aims to develop and implement such a DMSS to deliver the specified e-business planning model and statistical methodologies, which can provide integrated and intelligent support for decision makers during the entire decision making process. A SAS-based approach was selected as the desired system development and implementation environment. It is the first time that theoretical implications from management science, marketing strategies and economic rules have been integrated in a strategic DMSS and implemented in a field setting.

Title:

UNSUPERVISED ARTIFICIAL NEURAL NETWORKS FOR CLUSTERING OF DOCUMENT COLLECTIONS

Author(s):

Ayad Fekry Ayad , Abdel-Badeeh Salem , Mostafa Syiam

Abstract: The Self-Organizing Map (SOM) has shown to be a stable neural network model for high- dimensional data analysis. However, its applicability is limited by the fact that some knowledge about the data is required to define the size of the network. In this paper the Growing Hierarchical SOM (GHSOM) is proposed. This dynamically growing architecture evolves into a hierarchical structure of self–organizing maps according to the characteristics of input data. Furthermore, each map is expanded until it represents the corresponding subset of the data at specific level. We demonstrate the benefits of this novel model using a real world example from the document-clustering domain. Comparison between both models (SOM & GHSOM) was held to explain the difference and investigate the benefits of using GHSOM.

Title:

MULTILAYER PERCEPTRONS TECHNIQUE IN CLASSIFYING STOCKS: A CASE STUDY OF EGYPTIAN STOCKS EXCHANGE

Author(s):

Medhat Abdelaal

Abstract: Classification rates on out-of-sample predictions can often be improved through the use of model selection when fitting a model on the training data. In this paper, the multilayer perceptron neural network using the back-propagation algorithm is studies for the classification of financial variables of the Egyptian Stock Exchange. The best network architecture is made up of eleven layers: five input layers, five hidden layers and the output layer. The sensitivity analysis has been investigated. Sensitivity analysis can give important insights into the usefulness of individual variables. It often identifies variables that can be safely ignored in subsequent analysis, and key variables that must always be retained. Also, the receiver operating characteristic curve is used to compare between classifiers, and can be used to select an optimum decision threshold to select the best network which can catch most of the variability of the data.

Title:

LINGUISTIC DESCRIPTION OF PATTERNS FROM MINED IMAGES

Author(s):

Hema Nair

Abstract: The objective of this paper is to propose an approach to describe patterns in remote-sensed images utilising fuzzy logic. The general form of a linguistically quantified proposition is “QY’s are F” where Q is a fuzzy linguistic quantifier, Y is a class of objects and F is a summary that applies to that class. The truth of such a proposition can be determined for each object characterised by a tuple in the database. Fuzzy descriptions of linguistic summaries help to evaluate the degree to which a summary describes an object or pattern in the image. A genetic algorithm technique is used to obtain optimal solutions that describe all the objects or patterns in the database. Image mining is used to extract unusual patterns from multi-dated satellite images of a geographic area.

Title:

ARTIFICIAL INTELLIGENCE REPRESENTATIONS OF MULTI-MODEL BASED CONTROLLERS

Author(s):

Manuel de la Sen , Asier Ibeas

Abstract: This paper develops a representation of multi-model based controllers by using artificial intelligence typical structures. These structures will be neural networks, genetic algorithms and fuzzy logic. The interpretation of multimodel controllers in an artificial intelligence frame will allow the application of each specific technique to the design of multimodel based controllers. A method for synthesizing multimodel based neural network controllers from already designed single model based ones is presented. Some applications of the genetic algorithms and fuzzy logic to multimodel controller design are proposed.

Title:

COMPREHENSIBLE CREDIT-SCORING KNOWLEDGE VISUALIZATION USING DECISION TABLES AND DIAGRAMS

Author(s):

Jan Vanthienen , Christophe Mues , Bart Baesens

Abstract: One of the key decision activities in financial institutions is to assess the credit-worthiness of an applicant for a loan, and thereupon decide whether or not to grant the loan. Many classification methods have been suggested in the credit-scoring literature to distinguish good payers from bad payers. Especially neural networks have received a lot of attention. However, a major drawback is their lack of transparency. While they can achieve a high predictive accuracy rate, the reasoning behind how they reach their decisions is not readily available, which hinders their acceptance by practitioners. Therefore, we have, in earlier work, proposed a two-step process to open the neural network black box which involves: (1) extracting rules from the network; (2) visualizing this rule set using an intuitive graphical representation. In this paper, we will focus on the second step and further investigate the use of two types of representations: decision tables and diagrams. The former are a well-known representation originally used as a programming technique. The latter are a generalization of decision trees taking on the form of a rooted, acyclic digraph instead of a tree, and have mainly been studied and applied by the hardware design community. We will compare both representations in terms of their ability to compactly represent the decision knowledge extracted from two real-life credit-scoring data sets.

Title:

A COMPARISON BETWEEN THE PROPORTIONAL KEEN APPROXIMATOR AND THE NEURAL NETWORKS LEARNING METHODS

Author(s):

Peyman Kabiri

Abstract: The Proportional Keen Approximation method is a young learning method using the linear approximation to learn hypothesis. In the paper this methodology will be compared with another well-established learning method i.e. the Artificial Neural Networks. The aim of this comparison is to learn about the strengths and the weaknesses of these learning methods regarding different properties of their learning process. The comparison is made using two different comparison methods. In the first method the algorithm and the known behavioural model of these methods are analysed. Later, using this analysis, these methods are compared. In the second approach, a reference dataset that contains some of the most problematic features in the learning process is selected. Using the selected dataset the differences between two learning methods are numerically analysed and a comparison is made.

Title:

AN AGENT -BASED KNOWLEDGE MANAGEMENT MODEL FOR ENABLING A STATISTICAL TESTING APPROACH TO DECISION SUPPORT E-COMMERCE

Author(s):

Faiz Al-Shrouf , Walter James

Abstract: This paper integrates decision support e-commerce applications, and the knowledge management domain with software agent technology. First, we give a brief overview of decision support systems. Then we present our terminology for the decision support e-commerce model and its components that utilize e-commerce application, agent-based knowledge management components, and a statistical testing model. We give a scenario for a multi-bidding e-commerce application and formulate a statistical testing model (Likelihood ratio test) based on a bivariate normal distribution. The aforementioned test model uses the power function to simulate results using four main agents namely an Information searching agent, a computing agent, a knowledge agent, and a decision support agent.

Title:

CONSOLIDATED TREE CONSTRUCTION ALGORITHM: STRUCTURALLY STEADY TREES

Author(s):

Olatz Arbelaitz Gallego , Jesús Maria Pérez de la Fuente , Javier Muguerza Rivero , Ibai Gurrutxaga Goikoetxea

Abstract: This paper presents a new methodology for building decision trees or classification trees (Consolidated Trees Construction algorithm) that faces up the problem of unsteadiness appearing in the paradigm when small variations in the training set happen. As a consequence, the understanding of the made classification is not lost, making this technique different from techniques such as bagging and boosting where the explanatory feature of the classification disappears. The presented methodology consists on a new meta-algorithm for building structurally more steady and less complex trees (consolidated trees), so that they maintain the explaining capacity and they are faster, but, without losing the discriminating capacity. The meta-algorithm uses C4.5 as base classifier. Besides the meta-algorithm, we propose a measure of the structural diversity used to analyse the stability of the structural component. This measure gives an estimation of the heterogeneity in a set of trees from the structural point of view. The obtained results have been compared with the ones get with C4.5 in some UCI Repository databases and a real application of customer fidelisation from a company of electrical appliances.

Title:

PROMAIS: A MULTI-AGENT MODEL FOR PRODUCTION INFORMATION SYSTEMS

Author(s):

Khaled  Ghédira , Lobna HSAIRI , Faiez  Gargouri

Abstract: In the age of information proliferation and communication advances, Cooperative Information System (CIS) technology becomes a vital factor for production system design in every modern enterprise. In fact, current production system must hold to new strategic, economic and organizational structures in order to face new challenges. Consequently, intelligent software based on agent technology emerges to improve system design on the one hand, and to increase production profitability and enterprise competitive position on the other hand. This paper starts with an analytical description of logical and physical flows dealt with manufacturing, then proposes a Production Multi-Agent Information System (ProMAIS). ProMAIS is a collection of stationary and intelligent agent-agencies with specialized expertises, interacting to carry out the shared objectives: cost-effective production in promised delay and adaptability to the changes. In order to bring ProMAIS’s dynamic aspect out, interaction protocols are specially zoomed out by cooperation, negotiation and Contract Net protocols.

Title:

COGNITIVE REASONING IN INTELLIGENT MEDICAL INFORMATION SYSTEMS

Author(s):

Marek Ogiela

Abstract: This paper presents new approach for cognitive reasoning in the field of artificial intelligence, used in medical information systems. These systems are applied in various tasks supporting decisions taken in the wide area of Medical Imaging. Such systems, in particular Decision Support Systems can be based on the methods of perceptual cognitive analysis of visual medical data and are directed at offering possibilities of automatic interpretation and semantic understanding of this type of data. The paper shall present a general application method of DSS in selected cases of CR and MRI image meaning interpretation showing the development of disease processes.

Title:

A HYBRID DECISION SUPPORT TOOL

Author(s):

Panayotis Pintelas , sotiris kotsiantis

Abstract: In decision support systems a classification problem can be easily solved by employing one of several methods such as different types of artificial neural networks, decision trees, bayesian classifiers, etc. Moreover, it may happen that certain parts of instances’ space are better predicting by one method than the others. Thus, the decision of which particular method to choose is a complicated problem. A good alternative to choosing only one method is to create a hybrid forecasting system incorporating a number of possible solution methods as components (an ensemble of classifiers). For this purpose, we have implemented a hybrid decision support system that combines a neural net, a decision tree and a bayesian algorithm using a stacking variant methodology. The presented system can be trained with any data, but in the current implementation is mainly used by tutors of Hellenic Open University to identify drop-out prone students. However, a comparison with other ensembles using the same classifiers as base learner on several standard benchmark data sets, showed that this tool gives better accuracy in most cases.

Title:

PROBLEMS RESOLUTION IN MATHNET SYSTEM

Author(s):

Sofiane Labidi , Hélder Borges

Abstract: We proposed and developed an component to problems resolution within MATHNET Project environment. This project is result of the integration of the Computer Assisted Learning and Cooperative Learning paradigms, and that implements a computer model to interective environment of Cooperative Teaching and Learning based on multiple artificial and human agents, placed on a computer net structure, making use of several multimidia resources. The MATHNET nucleus is made of small software packs that effectively implement the Cooperative Learning paradigm. Due to its total integration with the computer, the use of multimidia resource and net technologies offer new oportunities on the educational field, challenging the traditional pedagogical methods to benefit the learning process. On this paper, we present the structure and the architecture of an Resolving Problem Assistant, that have the following objectivies: a) will carry fixation or evaluation problems to the student resolve them, regarding the apprentice profile; b) help the learner when necessary on the problem’s resolution and c) passing on an opinion about a solution found by an apprentice after analysing it.

Title:

STRUCTURAL INERTIA OF VOTING SYSTEMS

Author(s):

Francesc Carreras

Abstract: Simple games reflect with more or less fidelity the strategic tensions inherent to voting systems. An interesting feature of these systems is their capability to act, i.e. their decisiveness. We introduce in this work a normalized measure of the inertia of any simple game from the strictly structural or normative viewpoint. Mathematical properties of this measure are presented, including axiomatic characterizations. The application to a comparative study of certain actual voting systems evidences striking differences as to the inertia degrees they show.

Title:

ANALYSIS OF THE ITERATED PROBABILISTIC WEIGHTED K NEAREST NEIGHBOR METHOD, A NEW DISTANCE-BASED ALGORITHM

Author(s):

José María Martínez-Otzeta

Abstract: The k-Nearest Neighbor (k-NN) classification method assigns to an unclassified point the class of the nearest of a set of previously classified points. A problem that arises when aplying this technique is that each labeled sample is given equal importance in deciding the class membership of the pattern to be classified, regardless of the typicalness of each neighbor. We report on the application of a new hybrid version named Iterated Probabilistic Weighted k Nearest Neighbor algorithm (IPW-k-NN) which classifies new cases based on the probability distribution each case has to belong to each class. These probabilities are computed for each case in the training database according to the k Nearest Neighbors it has in this database; this is a new way to measure the typicalness of a given case with regard to every class. Experiments have been carried out using UCI Machine Learning Repository well-known databases and performing 10-fold cross-validation to validate the results obtained in each of them. Three different distances (Euclidean, Camberra and Chebychev) are used in the comparison done.

Title:

APPLICATION OF NEURAL NETWORKS FOR PRIOR APPRAISAL OF STRUCTURAL FUNDS PROJECT PROPOSALS

Author(s):

Tadeusz A. Grzeszczyk

Abstract: The subject of present paper is to discuss the layout of conception referred to the use of artificial intelligence methods (neural networks) for prior appraisal of project proposals to be submitted by Polish enterprises to European Union in order to get financial assistance for investments from the EU structural funds and the state budget. The experiments are limited to prior appraisal of the projects submitted only, as their practical execution may begin not earlier than on the 1st May 2004 (enlargement of European Union). Author of the present paper discusses the method referred to appraisal of project proposals submitted by enterprises. The method is related to review and acceptance of expenditures for investments co-financed by European Regional Development Fund. The author formulates conception for implementation of appraisal principles which could be considered as element of review and acceptance of expenditures according to Commission Regulation 1685/2000.

Title:

OPTIMIZATION OF NEURAL NETWORK’S TRAINING SETS VIA CLUSTERING: APPLICATION IN SOLAR COLLECTOR REPRESENTATION

Author(s):

João Paulo Domingos Silva , Daniel Alencar Soares , Antônia Sônia Cardoso Diniz , Elizabeth Marques Duarte Pereira , Luis Enrique Zárate Gálvez , Renato Vimieiro

Abstract: Due the necessity of new ways of energy producing solar collector systems have been widely used around the world. The efficiency of this kind of systems is calculated through measurement of process parameters. There are mathematical models that represent these systems. However these models involve several parameters that may lead to nonlinear equations of the process. Artificial Neural Networks have been proposed in this work as an alternative of these models. However, a better modeling of the process by means of ANN depends on a representative training set. In order to better define the training set, the clustering technique called k-means has been used in this work.

Title:

ONTOLOGY-BASED FRAMEWORK FOR DOCUMENT INDEXING

Author(s):

Youssef Amghar , D. Bahloul , P. Maret

Abstract: The work presented in this paper addresses a project for the Computer Center CIRITL1. This company wants to save and capitalize its knowledge and its know-how concerning the production activities in the particular, the technical hitches relating to software applications encountered during the exploitation of these applications. Indeed with a well accessing documents base; actors will be able to better solve problems. Our purpose is to focus on ontology-based framework for indexing relevant documents. The domain ontology (OntoCIRTIL) has a structure which supports a semantic model based on semantic links and inference mechanisms. In this paper, we present a new model called S3 which, permits to model knowledge in upstream and index documents (or formalized knowledge) in downstream. To illustrate partial results, this model is then applied to OntoCIRTIL.

Title:

AN INTELLIGENT TUTORING SYSTEM FOR DATABASE TRANSACTION PROCESSING

Author(s):

Paul Douglas

Abstract: We describe an intelligent tutoring system that may be used to assist university-level students to learn key aspects of database transaction processing. The tutorial aid is based on a well defined theory of learning, and is implemented using PROLOG and Java. Some results of the evaluation of the learning tool are presented to demonstrate its effectiveness as a tutorial aid in an e-learning environment.

Title:

A DISTRIBUTED TRANSIENT INTER-PRODUCTION SCHEDULING FOR FLEXIBLE MANUFACTURING SYSTEMS

Author(s):

Pascal YIM , Olfa Belkahla , Khaled GHEDIRA , Ouajdi KORBAA

Abstract: This paper deals with the problem of cyclic scheduling for Flexible Manufacturing Systems (FMS) and presents a new Multi-Agent Model, composed of cooperating agents, for computing the Transient states between successive cyclic productions (called transient inter-productions). It aims to minimize the global makespan while reducing temporal complexity. The originality of the model relies on the use of Artificial Intelligence techniques, Multi- Agent Systems and Production management. Indeed, the planning phase allows, in the cyclic context, to determine the cyclic productions to respect the initial demand. These cyclic productions have to be sequenced each one in relation to the others. Once this operation done, the transient state allowing going from a cyclic state to the following one has to be determined and optimized.

Title:

AN XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION

Author(s):

Zeng Xingjie , Li Fang , Zhang Dongmo

Abstract: Extensible Markup Language (XML) has been widely used as a middleware because of its flexibility. Fixed domain is one of the bottlenecks of Information Extraction (IE) technologies. In this paper we present a XML-based domain-adaptable bootstrapping method of pattern acquisition, which focuses on minimizing the cost of domain migration. The approach starts from a seed corpus with some seed patterns; extends the corpus based on the seed corpus through the Internet and acquires the new patterns from extended corpus. Positive and negative examples been classified from training corpus are used to evaluate the patterns acquired. The result shows our method is a practical way in pattern acquisitions.

Title:

DYNAMIC MULTI-AGENT BASED VARIETY FORMATION AND STEERING IN MASS CUSTOMIZATION

Author(s):

Nizar Abdelkafi , Gerhard Friedrich , Gerold Kreutler , Thorsten Blecker

Abstract: Large product variety in mass customization involves a high internal complexity level inside a company’s operations, as well as a high external complexity level from a customer’s perspective. To cope with both complexity problems, an information system based on agent technology is able to be identified as a suitable solution approach. The mass customized products are assumed to be based on a modular architecture and each module variant is associated with an autonomous rational agent. Agents have to compete with each other in order to join coalitions representing salable product variants which suit real customers’ requirements. The negotiation process is based on a market mechanism supported by the target costing concept and a Dutch auction. Furthermore, in order to integrate the multi-agent system in the existing information system landscape of the mass customizer, a technical architecture is proposed and a scenario depicting the main communication steps is specified

Title:

USING MAS TO SOLVE PRODUCER CUSTOMER TRANSPORT PROBLEMS

Author(s):

Baltazar Frankovic , Tung Dang

Abstract: This paper deals with a problem of using multi-agent technology to simulate and resolve the planning problems. Concretely, multi-agent systems (MAS) are used in studying and resolving the optimization problems within the Producer-Customer-Transport (PCT) domain.

Title:

IMAGE CLASSIFICATION ACCORDING TO THE DOMINANT COLOUR

Author(s):

Amine Aït Younes , Isis Truck , Herman Akdag , Yannick Remion

Abstract: The aim of this work is to develop a user-friendly software allowing him to classify images according to their dominant colour expressed through linguistic expressions. With this aim in view, images are processed and stored in a database. The processing consists in assigning a profile to each image. To do this, we consider the pixels of the images in the colorimetric space HLS and then a restricted number of colours classes are built. These classes depend on the hue (H). For each colour class a certain number of subclasses depending on the lightness (L) and the Saturation (S) are defined. Finally the profile is drawn using the pixels membership of the classes and subclasses. Thus starting from a linguistic expression of a colour, the user can extract images from the database.

Title:

AN IMPLEMENTATION ENVIRONMENT OF KNOWLEDGE DISCOVERY SYSTEMS

Author(s):

Maria Dias , Roberto Pacheco

Abstract: After an organization having solved its operational problems, the need of systems appears for the support to the decision taking. Data mining is an area that is growing quickly to assist such new needs of the organization. However, the use of data mining techniques is uncommon by the difficult normally found in the development of knowledge discovery systems. This paper presents an environment of knowledge discovery in database, called ADesC. The main objective is to generate relevant information to decision taking, with the application of data mining techniques. This environment is based on agent technology to facilitate the performance of its tasks.

Title:

MULTI-AGENT APPROACH BASED ON TABU SEARCH FOR THE FLEXIBLE JOB SHOP SCHEDULING PROBLEM

Author(s):

Meriem Ennigrou , Khaled Ghédira

Abstract: This paper proposes a Multi-agent approach based on a tabu search method for solving the flexible Job Shop scheduling problem. The characteristic of the latter problem is that one or several machines can process one operation so that its processing time depends on the machine used. Such a generalization of the classical problem makes it more and more difficult to solve. The objective is to minimize the makespan or the total duration of the schedule. The proposed model is composed of three classes of agents: Job agents and Resource agents which are responsible for the satisfaction of the constraints under their jurisdiction, and an Interface agent containing the tabu search core. Different experimentations have been performed on different benchmarks and results have been presented.

Title:

MONTHLY FLOW ESTIMATION USING ELMAN NEURAL NETWORKS

Author(s):

Luiz Biondi Neto , João  Soares de Mello , Maria Fernandes Velloso , Lidia Angulo Meza , Pedro Gouvêa Coelho

Abstract: This paper investigates the application of partially recurrent artificial neural networks (ANN) in the flow estimation for São Francisco River that feeds the hydroelectric power plant of Sobradinho. An Elman neural network was used suitably arranged to receive samples of the flow time series data available for São Francisco River shifted by one month. For that, the neural network input had a delay loop that included several sets of inputs separated in periods of five years monthly shifted. The considered neural network had three hidden layers. There is a feedback between the output and the input of the first hidden layer that enables the neural network to present temporal capabilities useful in tracking time variations. The data used in the application concern to the measured São Francisco river flow time series from 1931 to 1996, in a total of 65 years from what 60 were used for training and 5 for testing. The obtained results indicate that the Elman neural network is suitable to estimate the river flow for 5 year periods monthly. The average estimation error was less than 0.2 %.

Title:

ISYDS - INTEGRATED SYSTEM FOR DECISION SUPPORT

Author(s):

Pedro Gouvêa Coelho , Eliane Gonçalves Gomes , João  Soares de Mello , Lidia Angulo Meza , Luiz Biondi Neto

Abstract: Data Envelopment Analysis is based on linear programming problems (LPP) to determine the efficiency of Decision Making Units (DMUs). This process can be computationally intense, as a LPP has to be run for each unit. Besides, a typical DEA LPP has a large number of redundant constraints concerning the inefficient DMUs. That results in degenerate LPPs and in some cases multiple efficient solutions. The developed work intends to to fill out a gap in current DEA softwares i.e. the lack of a software capable of producing full results in classic DEA models as well as the capability of using more advanced DEA models. The software interface as well as the models and solution algorithms were implemented in Delphi. Both basic and advanced DEA models are allowed in the software. Besides the main module that includes the DEA models, there is an additional module containing some models for decision support such as the multicriteria model called Analytic Hierarchic Process (AHP). The developed software was named as ISYDS – Integrated System for Decision Support. The software has been used in several theoretical and applied papers and has been very useful.

Title:

FACE PATTERN DETECTION

Author(s):

Adriano Moutinho , Antonio Carlos Thome , Luiz Biondi Neto , Pedro Henrique Golvea Coelho

Abstract: Security systems based on face recognition often have to deal with the problem of finding and segmenting the region of the face, containing nose, mouth and eyes, from the rest of the objects in the image. Finding the right position of a face is a part of any automatic identity recognition system, and it is, by itself, a very complex problem to solve, normally being handled separately. This paper describes an approach, using arti-ficial neural networks (ANN), to find the correct position and separate the face from the background. In order to accomplish this goal, a windowing method was created and combined with several image pre-processing steps, from histogram equalization to illumination correction, as an attempt to improve neural network recognition capability. This paper also proposes methods to segment facial features such as mouth, nose and eyes. Finally, the system is tested using 400 images and the performance of face and facial features segmentation is presented

Title:

RESULT COMPARISON OF TWO ROUGH SET BASED DISCRETIZATION ALGORITHMS

Author(s):

Shanchan Wu , Wenyuan Wang

Abstract: The area of knowledge discovery and data mining is growing rapidly. A large number of methods are employed to mine knowledge. Many of the methods rely of discrete data. However, most of the datasets used in real application have attributes with continuous values. To make the data mining techniques useful for such datasets, discretization is performed as a preprocessing step of the data mining. In this paper, we discuss rough set based discretization. We do experiments to compare the quality of Local discretization and Global discretization based on rough set. Our experiments show that Global discretization and Local discretization are dataset sensitive. Neither of them is always better than the other, though in some cases Global discretization generates far better results than Local discretization.

Title:

MANAGING ENGINEERING ASSETS: A KNOWLEDGE BASED ASSET MANAGEMENT METHODOLOGY THROUGH INFORMATION QUALITY

Author(s):

Abrar Haider

Abstract: As manufacturing organizations are becoming technology intensive, asset management is becoming crucial for profitability and efficiency of the business. Ensuring asset reliability, maintenance and management is profoundly dependent on knowledge based decision support backed by quality information. Multiplicity of data acquisition systems and techniques, together with the operation of assets in often unsettled and variable environments, makes it difficult to obtain quality information that could be used to make informed choices. Asset maintenance and reliability are important activities that can considerably influence an organisation’s ability to compete. This paper discusses the importance of data and information quality within asset management by analysing the intricacies of data quality and information flow within asset management systems and processes; and proposes frameworks for information quality and a model for an information driven, knowledge based asset management.

Title:

A DSS FOR ASSESSING TECHNOLOGY ENVIRONMENTS

Author(s):

Giovanni Camponovo , Yves Pigneur , Samuel Bendahan

Abstract: Assessing the external environment is an important component of organizations' survival and success. Unfortunately, a huge amount of information must be collected and processed in order to obtain a thorough and comprehensive representation of the environment. A decision support system can be very useful in helping decision makers to organize and analyze this information efficiently and effectively. This paper outlines a conceptual proposition helping to design such a system by presenting an ontology of the relevant information elements (actors, issues and needs) and a set of tools to analyze them. This paper also illustrates a prototype version of one of these tools which supports the analysis of the actors and issues perspectives.

Title:

WEB USAGE MINING WITH TIME CONSTRAINED ASSOCIATION RULES

Author(s):

Jan Vanthienen , Johan Huysmans , Bart Baesens

Abstract: Association rules are typically used to describe what items are frequently bought together. One could also use them in web usage mining to describe the pages that are often visited together. In this paper, we propose an extension to association rules by the introduction of timing constraints. Subsequently, the introduced concepts are used in an experiment to pre-process logfiles for web usage mining. We also describe how the method could be useful for market basket analysis and give an overview of related research. The paper is concluded by some suggestions for future research.

Title:

DOCTUS INTELLIGENT EXECUTIVE PORTAL FOR BUSINESS DECISIONS. USING HARD DATA AND SOFT KNOWLEDGE IN DOCTUS INTELLIGENT PORTAL

Author(s):

Zoltan Baracskai , Zoltan Nagy

Abstract: Business decision makers live in the avalanche of information, and the environment produces all kinds of surprises. However, companies should survive, that is what business decision makers struggle for. DoctuS, our knowledge based system; using case-based reasoning is integrated in an intelligent portal which links the soft knowledge of experts with the help of a company-wide knowledge map and its "zoom-ins". Also hard data can be derived ("mined") from data warehouses or integrated information systems. The portal supports business decision makers to take routine decisions easily, but also enables the contact to the Knowledge Factory, where new knowledge is created. This can assist business decision makers to make original decisions.

Title:

MODEL P : AN APPROACH OF THE ADAPTABILITY

Author(s):

Claude Petit , Mathilde Billy , François-Xavier Magaud

Abstract: This paper summarizes a new approach of the Cased-based Reasoning. The cases are not stored. The problem case solution is built as a puzzle. The puzzle obtained corresponds to the required solution. Each part is carrying information and has an associative behaviour. A piece seeks the piece which can be associated in width and in depth method. This associative behaviour is determined by several mechanisms: engine of expert system to binary rules, model of multicriterion choice of ordinal outclassing, search for close indices. A puzzle can thus have a complex mode of reasoning; each piece has a specific behaviour. The tool was tested on two applications of decision-making aid: identification of malaria facies and assistance to the specification of habitats.) These applications made it possible to check the interest of this original framework. In particular it brings an elegant solution to the phase of adaptation in CBR technique.

Title:

DATA MINING OF CRM KNOWLEDGE BASES FOR EFFECTIVE MARKET SEGMENTATION: A CONCEPTUAL FRAMEWORK

Author(s):

Jounghae  Bang , Nikhilesh Dholakia , Lutz Hamel , Ruby Roy Dholakia

Abstract: This paper illustrates the linkages between CRM systems, data mining techniques, and the strategic notions of market segmentation and relationship marketing. Using the hypothetical example of a consumer bank, the data in a relationship based marketing environment are illustrated and guidelines for knowledge discovery, data management and strategic marketing are developed.

Title:

A QUALITATIVE MODEL OF THE INDEBTEDNESS FOR THE SPANISH AUTONOMOUS REGIONS

Author(s):

Juan Moreno García , Luis Jimenez Linares , José Jesús Castro Sanchez , Victor Raúl López , José Baños

Abstract: This work shows a fuzzy model of the indebtedness for the Spanish autonomous regions that is obtained using approximate reasoning and induction methods. So, the algorithm ADRI is used to induce a linguistic model composed by a set of fuzzy rules. The quality of this linguistic model will be checked and its interpretation will be shown.

Title:

BDI AGENTS WITH FUZZY ASSOCIATIVE MEMORY FOR VESSEL BERTHING IN CONTAINER PORTS

Author(s):

Damminda Alahakoon , Parakrama Dissanayake , Prasanna  Lokuge

Abstract: Vessel scheduling in container terminals has to undergo various vague constraints of different issues and often use uncertain dynamically changing data. Faster turnaround time of the vessels in berths has direct impact on the improvement of terminals productivity. The need for an intelligent system that dynamically adapts to the changing environment is apparent, as there is limited number of berths and resources available in container terminals for delivering services to vessels. BDI (Beliefs, Desires and Intentions) agents are being proposed in a complex collaborative environment in the vessel scheduling assuring better management and control in the terminal. BDI agents to deal with many criteria and different goals with uncertain beliefs, it is proposed that fuzzy associative memory to use in the planning process of the BDI architecture facilitating better decision making in the whole process. In this paper we propose hybrid BDI architecture with fuzzy associative memory in handling uncertainty issues of the vessel berthing in container terminals. Execution of Plans in a collaborative multi agent environment would be strengthened with the introduction of fuzzy associative memory in BDI agents. Plans in the BDI agents are being constructed at different stages in order to achieve current desires. This would facilitate agents to observe dynamic changes in the environment and to be reflected in the next levels of planning.

Title:

DYNAMIC DIAGNOSIS OF ACTIVE SYSTEMS WITH FRAGMENTED OBSERVATIONS

Author(s):

Gianfranco Lamperti

Abstract: Diagnosis of discrete-event systems (DESs) is a complex and challenging task. Typical application domains include telecommunication networks, power networks, and digital-hardware networks. Recent blackouts in northern America and southern Europe offer evidence for the claim that automated diagnosis of large-scale DESs is a major requirement for the reliability of this sort of critical systems. The paper is meant as a little step toward this direction. A technique for the dynamic diagnosis of active systems with uncertain observations is presented. The essential contribution of the method lies in its ability to cope with uncertainty conditions while monitoring the systems, by generating diagnostic information at the occurrence of each newly-received fragment of observation. Uncertainty stems, on the one hand, from the complexity and distribution of the systems, where noise may affect the communication channels between the system and the control rooms, on the other, from the multiplicity of such channels, which is bound to relax the absolute temporal ordering of the observable events generated by the system during operation. The solution of these diagnostic problems requires nonmonotonic reasoning, where estimates of the system state and the relevant candidate diagnoses may not survive the occurrence of new observation fragments.

Title:

AN EFFICIENT FRAMEWORK FOR ITERATIVE TIME-SERIES TREND MINING

Author(s):

Ken Barker , Ajumobi  Udechukwu

Abstract: Trend analysis has applications in several domains including: stock market predictions, environmental trend analysis, sales analysis, etc. Temporal trend analysis is possible when the source data (either business or scientific) is collected with time stamps, or with time-related ordering. These time stamps (or orderings) are the core data points for time sequences, as they constitute time series or temporal data. Trends in these time series, when properly analyzed, lead to an understanding of the general behavior of the series so it is possible to more thoroughly understand dynamic behaviors found in data. This analysis provides a foundation for discovering pattern associations within the time series through mining. Furthermore, this foundation is necessary for the more insightful analysis that can only be achieved by comparing different time series found in the source data. Previous works on mining temporal trends attempt to efficiently discover patterns by optimizing discovery processes in a single pass over the data. Recent experience with data mining clearly indicates that the process is inherently iterative, with no guarantees that the best results are achieved in the first pass. Current iterative proposals introduce expensive re-computation after tuning the algorithm to address shortcomings discovered in the first heavy weight pass over the data. In fact, the same heavy weight process is then re-run on the data in the hope that new discoveries will be made on subsequent iterations. Unfortunately, this heavy weight re-execution and processing of the data is expensive. In this work we present a framework in which all the frequent trends in the time series are computed in a single pass, thus eliminating expensive re-computations in subsequent iterations. We also demonstrate that trend associations within the time series or with related time series can be found.

Title:

AUTOMATED PRODUCT RECOMMENDATION BY EMPLOYING CASE-BASED REASONING AGENTS

Author(s):

Reda Alhajj , Ozgur Baykal , Faruk Polat

Abstract: This paper proposes a cooperation framework for multiple role-based case-based reasoning (CBR) agents to handle the product recommendation problem for e-commerce applications. Each agent has different case structure with intersecting features and agents exploit all information related to the problem by cooperation, which is accomplished through the merge of distributed cases in order to form cases having better representation of the problem. The presented merge algorithm handles noisy distributed cases by negotiation on the difference values of the intersecting features. The role-based CBR agents merge the distributed cases by introducing a global heuristic function, which is used to evaluate the relevance of merged cases. The heuristic function exploits the relevancy of each merged case within the viewpoint of each agent and the satisfied/unsatisfied problem constraints. The viewpoint of an agent is represented by the value of consistency of distributed components of merged cases and agent’s individual relevance values of the merged cases. Finally, the proposed framework has been tested for elective course recommendation.

Title:

ASSESSMENT OF SPILLAGE OF LARGE-SCALE HYDROPOWER PLANT UNDERTAKING SPINNING RESERVE

Author(s):

Maihuan Zhao , Qiang Huang , Chenguang Xu

Abstract: Since the large-scale hydropower plant must undertake the spinning reserve of power system, a small amount of out-flow water does not generate electricity. In order to increase water use efficiency, it is necessary to calculate the spillage owing to improper dispatch. Therefore, the optimal operation of hydroelectric systems should be obligated with the undertaking spinning reserve. A calculation method of the spillage owing to improper dispatch for large-scale is discussed. And this method is used to calculate the spillage owing to improper dispatch of Longyangxia hydropower plant in 2001. The spillage owing to improper dispatch is remarkable, which could be saved by proper dispatch.

Title:

REDUCING REWORK IN THE DEVELOPMENT OF INFORMATION SYSTEMS THROUGH THE COMPONENTS OF DECISIONS

Author(s):

Bernadette Sharp , Andy Salter , Hanifa Shah

Abstract: The failure of information systems has been partially the result of incorrect or inefficient rework in the development of the systems. If greater transparency can be made in the decision making process then the number of examples of incorrect or inefficient rework could be reduced. Transparency in the process of development can be achieved through identifying and tracking the components of the decisions made during the development of the information system. This paper presents a theoretical framework for facilitating this tracking by comparing the components of the decisions in the development of the information system with those of an organisation and considering how the ‘needs’ of agents and the actions taken to fulfil those needs are related.

Title:

MINING SEQUENTIAL PATTERNS WITH REGULAR EXPRESSION CONSTRAINTS USING SEQUENTIAL PATTERN TREE

Author(s):

Mohamed Younis

Abstract: The significant growth of sequence database sizes in recent years increase the importance of developing new techniques for data organization and query processing. Discovering sequential patterns is an important problem in data mining with a host of application domains. For effectiveness and efficiency consideration, constraints are essential for many sequential applications. In this paper, we give a brief review of different sequential pattern mining algorithms, and then introduce a new algorithm (termed NewSPIRIT) for mining frequent sequential patterns that satisfy user specified regular expression constraints. The general idea of our algorithm is to use a finite state automata to represent the regular expression constraints and build a sequential pattern tree that represents all sequences of data which satisfy this constraints by scanning the database of sequences only once. Experimental results shows that our NewSPIRIT is much more efficient than existing algorithms.

Title:

WAREHOUSING AND MINING OF HIGHER EDUCATION DATA: USING EXISTING DATA TO MANAGE QUALITY

Author(s):

Pieter Conradie , Liezl  Van Dyk

Abstract: Data warehouses are constructed at higher education institutions (HEI) using data from transactional systems such as the student information system (SIS), the learning management system (LMS), the learning content management system (LCMS) as well as certain enterprise resource planning (ERP) modules. The most common HEI data mining applications are directed towards customer relationship management (CRM) and quality management. When students are viewed as material in manufacturing process, instead of the customer, different meaningful correlations, patterns and trends can be discovered which otherwise would have remained unexploited. As example statistical process control (SPC) – as data mining tool – is applied to student result data. This may eliminate the need to gather student-customer feedback for quality control purposes.

Title:

PREDICTING WEB REQUESTS EFFICIENTLY USING A PROBABILITY MODEL

Author(s):

Shanchan Wu , Wenyuan Wang

Abstract: As the world-wide-web grows rapidly and a user's browsing experiences are needed to be personalized, the problem of predicting a user's behavior on a web-site has become important. In this paper, we present a probability modal to utilize path profiles of users from web logs to predict the user's future requests. Each of the user's next probable requests is given a conditional probability value, which is calculated according to the function presented by us. Our modal can give several predictions ranked by the values of their probability instead of giving one, thus increasing recommending ability. Based on a compact tree structure, our algorithm is efficient. Our result can potentially be applied to a wide range of applications on the web, including pre-sending, pre-fetching, enhancement of recommendation systems as well as web caching policies. The experiments show that our modal has a good performance.

Title:

DATA MINING: PATTERN MINING AS A CLIQUE EXTRACTING TASK

Author(s):

Grete Lind , Rein Kuusik , Leo Võhandu

Abstract: One of the important tasks in solving data mining problems is finding frequent patterns in a given dataset. It allows to handle several tasks such as pattern mining, discovering association rules, clustering etc. There are several algorithms to solve this problem. In this paper we describe our task and results: a method for reordering a data matrix to give it a more informative form, problems of large datasets, (frequent) pattern finding task. Finally we show how to treat a data matrix as a graph, a pattern as a clique and pattern mining process as a clique extracting task. We present also a fast diclique extracting algorithm for pattern mining.

Title:

MULTIPLE ORGAN FAILURE DIAGNOSIS USING ADVERSE EVENTS AND NEURAL NETWORKS

Author(s):

Paulo Cortez

Abstract: In the past years, the Clinical Data Mining arena has suffered a remarkable development, where intelligent data analysis tools, such as Neural Networks, have been successfully applied in the design of medical systems. In this work, Neural Networks are applied to the prediction of organ dysfunction in Intensive Care Units. The novelty of this approach comes from the use of adverse events, which are triggered from four bedside alarms,being achived an overall predictive accuracy of 70%.

Title:

MINING SCIENTIFIC RESULTS THROUGH THE COMBINED USE OF CLUSTERING AND LINEAR PROGRAMMING TECHNIQUES

Author(s):

Sergio Greco , Andrea  Tagarelli , Irina Trubitsyna

Abstract: The paper proposes a technique based on a combined approach of data mining algorithms and linear programming methods for classifying organizational units, such as research centers. We exploit clustering algorithms for grouping information concerning the scientific activity of research centers. We also show that the replacement of an expensive efficiency measurement, based on the solution of linear programs, with a simple formula allows to efficiently compute clusters of very good quality. Some initial experimental results, obtained from the analysis of research centers in the agro-food sector, show the effectiveness of our approach, both from an efficiency and a quality-of-results viewpoint.

Title:

APPLICATION OF UNCERTAIN VARIABLES TO STABILITY ANALYSIS AND STABILIZATION FOR ABR ATM CONGESTION CONTROL SYSTEMS

Author(s):

Magdalena Turowska

Abstract: The paper presents the application of uncertain variables to stability analysis and stabilization of ABR ATM control systems. The unknown parameter is assumed to be a value of uncertain variable described by the certainty distribution given by a expert. The estimation of the certainty index that the congestion control system is stable is presented. A specific stabilization problem is considered.

Title:

HIERARCHICAL MODEL-BASED CLUSTERING FOR RELATIONAL DATA WITH AGGREGATES

Author(s):

Jianzhong CHEN , Sally McClean , Mary Shapcott , Kenny Adamson

Abstract: Clustering is a widely used technique in data mining to discover patterns in the underlying data. Most traditional clustering methods handle datasets that have single flat formats. Recently, there has been a growing interest in relational data mining, which deals with datasets containing multiple types of objects and richer relationships and are presented in relational formats, e.g. relational databases that have multiple tables. In this paper, we propose a hierarchical model-based method for clustering relational data by introducing frequency aggregates. We first define a relational data model that contains composite objects as an object-relational star schema, and present a method of integrating relational composite objects into flat aggregate objects through aggregation. In order to apply a hierarchical model-based clustering with the data, we define a new type of aggregates -- frequency aggregate, which has a vector data type and can be used to record not only the observed values but also the distribution of the values of a categorical attribute. A hierarchical agglomerative clustering algorithm with log-likelihood distance is then applied to cluster the aggregated data tentatively. After stopping at a coarse estimate of the number of clusters, a mixture model-based method with the EM algorithm is developed to perform a further relocation clustering, in which Bayes Information Criterion (BIC) is used to determine the optimal number of clusters. Finally we evaluate our approach on a real-world dataset.

Title:

BUILDING PROVEN CAUSAL MODEL BASES FOR STRATEGIC DECISION SUPPORT

Author(s):

Christian Hillbrand

Abstract: Since many Decision Support Systems (DSS) in the area of causal strategy planning methods incorporate techniques to draw conclusions from an underlying model but fail to prove the implicitly assumed hypotheses within the latter, this paper focuses on the improvement of the model base quality. Therefore, this approach employs Artificial Neural Networks (ANNs) to infer the underlying causal functions from empirical time series. As a prerequisite for this, an automated proof of causality for nomothetic cause-and-effect hypotheses has to be developed.

Title:

A SEMI-AUTOMATIC BAYESIAN ALGORITHM FOR ONTOLOGY LEARNING

Author(s):

Mario Vento , Massimo De Santo , Francesco Colace , Pasquale Foggia

Abstract: The entire world is living a transformation, perhaps the more important of last thirty years. The dissemination of the new technologies of the information is modifying radically the nature of the relationships between countries, markets, persons and culture. The technological revolution has favoured the process of globalization (Internet represents better than every other thing the global village) and the exchange of the information. Today the information can be considered an economic good whose value is closely connected the knowledge that can give. The dynamism of the new society forces the professional man to be abreast of technical progress. It is essential to introduce new didactic methodologies based on continuous long-life learning. A good solution can be E-learning. Although distance education environments are able to provide trainees and instructors with cooperative learning atmosphere, where students can share their experiences and teachers guide them in their learning, some problems must be still solved. One of the most important problem to solve is the correct definition of the domain of knowledge (i.e. ontology) related to the various courses. Often teachers are not able to easily formalize in correct way the reference ontology. On the other hand if we want realize some intelligent tutoring system that can help students and teachers during the learning process starting point is the ontology. In addition, the choice of best contents and information for students is closely connect to the ontology. In this paper, we propose a method for learning ontologies used to model a domain in the field of intelligent e-learning systems. This method is based on the use of the formalism of Bayesian networks for representing ontologies, as well as on the use of a learning algorithm that obtains the corresponding probabilistic model starting from the results of the evaluation tests associated with the didactic contents under examination. Finally, we will present an experimental evaluation of the method using data coming from real courses.

Title:

BAYESIAN NETWORK STRUCTURAL LEARNING FROM DATA: AN ALGORITHMS COMPARISON

Author(s):

Francesco Colace , Pasquale Foggia , Mario Vento , Massimo De Santo

Abstract: The manual determination of Bayesian Network structure or, more in general, of the probabilistic models, in particular in the case of remarkable dimensions domains, can be complex, time consuming and imprecise. Therefore, in the last years the interest of the scientific community in learning bayesian network structure from data is considerably increased. In fact, many techniques or disciplines, as data mining, text categorization, ontology description, can take advantages from this type of processes. In this paper we will describe some possible approaches to the structural learning of bayesian networks and introduce in detail some algorithms deriving from these ones. We will aim to compare results obtained using the main algorithms on databases normally used in literature. With this aim, we have selected and implemented five algorithms more used in literature. We will estimate the algorithms performances both considering the network topological reconstruction both the correct orientation of the obtained arcs.

Title:

MINING THE RELATIONSHIPS IN THE FORM OF THE PREDISPOSING FACTORS AND CO-INCIDENT FACTORS AMONG NUMERICAL DYNAMIC ATTRIBUTES IN TIME SERIES DATA SET BY USING THE COMBINATION OF SOME EXISTING TECHNIQUES

Author(s):

Suwimon  Kooptiwoot

Abstract: Temporal mining is a natural extension of data mining with added capabilities of discovering interesting patterns, inferring relationships of contextual and temporal proximity and may also lead to possible cause-effect associations. Temporal mining covers a wide range of paradigms for knowledge modeling and discovery. A common practice is to discover frequent sequences and patterns of a single variable. In this paper we present a new algorithm which is the combination of many existing ideas consists of the reference event as proposed in (Bettini, Wang et al. 1998), the event detection technique proposed in (Guralnik and Srivastava 1999), the large fraction proposed in (Mannila, Toivonen et al. 1997), the causal inference proposed in (Blum 1982) We use all of these ideas to build up our new algorithm for the discovery of multi-variable sequences in the form of the predisposing factor and co-incident factor of the reference event of interest. We define the event as positive direction of data change or negative direction of data change above a threshold value. From these patterns we infer predisposing and co-incident factors with respect to a reference variable. For this purpose we study the Open Source Software data collected from SourceForge website. Out of 240+ attributes we only consider thirteen time dependent attributes such as Page-views, Download, Bugs0, Bugs1, Support0, Support1, Patches0, Patches1, Tracker0, Tracker1, Tasks0, Tasks1 and CVS. These attributes indicate the degree and patterns of activities of projects through the course of their progress. The number of the Download is a good indication of the progress of the projects. So we use the Download as the reference attribute. We also test our algorithm with four synthetic data sets include noise up to 50 %. The results show that our algorithm can work well and tolerate to the noise data.

Title:

MINING THE RELATIONSHIPS IN THE FORM OF PREDISPOSING FACTOR AND CO-INCIDENT FACTOR IN TIME SERIES DATA SET BY USING THE COMBINATION OF SOME EXISTING IDEAS WITH A NEW IDEA FROM THE FACT IN THE CHEMICAL REACTION

Author(s):

Suwimon Kooptiwoot

Abstract: In this work we propose new algorithms from the combination of many existing ideas consisting of the reference event as proposed in (Bettini, Wang et al. 1998), the event detection technique proposed in (Guralnik and Srivastava 1999), the causal inference proposed in (Blum 1982; Blum 1982) and the new idea about the character of the catalyst seen in the chemical reaction. We use all of these ideas to build up our algorithms to mine the predisposing factor and co-incident factor of the reference event of interest. We apply our algorithms with OSS (Open Source Software) data set and show the result. We also test our algorithms with four synthetic data sets include noise up to 50 %. The results show that our algorithms can work well and tolerate to noise data.

Title:

THE DEVELOPMENT OF A KNOWLEDGE SYSTEM FOR ISO 9001 QUALITY MANAGEMENT

Author(s):

Hsun-Cheng HU , Sheng-Tun Li , Li-Yen Shue

Abstract: Many researchers in knowledge management point out that the first step toward knowledge management is the management of documents. However, the complexity imbedded in some documents could present great difficulty for most methodologies to deal with. The knowledge content for building an excellent quality management system that complies with ISO 9001 falls into this category; this knowledge is characterized by multi-dimensionality and knowledge embedment through various procedures and forms. We applied Ontology, which is a new approach in AI for better presenting knowledge structure of a domain, to develop a knowledge-based ISO 9001 quality management system for a Taiwanese chemical company that has to refer to a total of 175 ISO manuals. This system is built with Protégé 2000 as the knowledge platform, and we follow the development process recommended by Ontology Engineering of Toronto Virtual Enterprise. One main feature of the system is its capability of understanding the semantic of documents, which is a vital part of the inference mechanism in answering user’s queries.

Title:

AN EXPERIENCE WITH THE NEURAL NETWORK FOR AUTO-LANDING SYSTEM OF AN AIRCRAFT

Author(s):

sreenatha anavatti

Abstract: Abstract: Generalization by the Neural Networks is an added advantage that can provide very good robustness and disturbance rejection properties. By providing a sufficient number of training samples (inputs and their corresponding outputs), a network can deal with some inputs it has never seen before. This ability makes them very interesting for control applications because not only they can learn complicated control functions but they are able to respond to changing or unexpected environments. Aircraft landing system provides one such scenario wherein the flight conditions change quite dramatically over the path of descent. The present work discusses the training of a neural network to imitate a robust controller for auto-landing of an aircraft. The comparisons with the robust controller indicate the additional advantages of the neural network. The effects of disturbance and sensitivity analysis are presented to high light the generalization property of the neural network.

Title:

KNOWLEDGE MANAGEMENT AND ITS APPLICATION TO IMPROVE WORKFLOW

Author(s):

Tung Dang , Baltazar Frankovic

Abstract: This paper deals with one of many problems associated with building and developing a platform, based on the multi-agent technology for assisting office employees in their organization, and that is a problem of classification and identification of the right contacts. In order to assist newly arrived employees, agents search of the contacts used by previous employees and extract the one that is most appropriate for assisting the current activity. This paper presents methods for classification and selection of contacts based on the CBR technique and the forward search principle. The process of searching contacts is guided by user’s personal criteria. At the end, this paper discusses some possible techniques to solve user’s requirements, which cannot be achieved by using traditional search methods.

Title:

APPLYING DATA MINING TO SOFTWARE DEVELOPMENT PROJECTS: A CASE STUDY

Author(s):

Jacinto Mata Vázquez

Abstract: One of the main challenges that the project managers have during the building process of a software development project (SDP) is to optimise the values of the parameters that measure the viability of the final process. The accomplishment of this task, something that was not easy at the beginning, was helped with the appearance of dynamic models and simulation environments. The application of data mining techniques to the managing of Software Development Projects (SDP) is not an uncommon phenomenon, as in any other productive process that generates information in the way of input data and output variables. In this paper, we present and analyze the results obtained from a tool, developed by the authors, based on a Knowledge Discovery in Databases (KDD) technique. One of the most important contributions of these techniques to the software engineering field is the possibility of improving the management process of an SDP. The purpose is to provide accurate decision rules in order to help the project manager to take decisions during the development.

Title:

AN ADAPTABLE TIME-DELAY NEURAL NETWORK FOR PREDICT THE SPANISH ECONOMIC INDEBTEDNESS

Author(s):

Waldo Fajardo Contreras , Manuel Pegalar Cuellar , Mª Carmen Pegalajar Jimenez , Mª Angustias Navarro Ruiz , Ramón Pérez Pérez

Abstract: In this paper, we study and predict the indebtedness economic for the autonomic of Spain. In turn, we use model of neural network. In this study, we assess the feasibility of the Time-Delay neural network as an alternative to these classical forecasting models. This neural network permits accumulate more values of pass and to predict best the future. We show the assignment MSE to check the good forecasting of indebtedness economic.

Title:

A COMPARATIVE STUDY OF EVOLUTIONARY ALGORITHMS FOR TRAINING OF ELMAN RECURRENT NEURAL NETWORKS TO PREDICT THE AUTONOMOUS INDEBTNESS

Author(s):

M. Carmen Pegalajar , Manuel-Pegalajar Cuéllar

Abstract: In this paper we will show a training model for Elman Recurrent Neural Networks, based on Evolutionay Algorithms. It will be applied to Spanish Autonomous Indebtness Prediction. Applied Evolurionay Algorithms are Classic Genetic Algorithms, Multimodal Clearing algorithm and CHC algorithm. We will make a comparative study, training the net with each evolutionay algorithm to see the affectiveness of each training model to predict the Spanish Autonomous Indebtness.

Title:

DEVELOPMENT OF EXPERT SYSTEM FOR DETECTING INCIPIENT FAULTS IN TRANSFORMER BY USING DISSOLVED GAS ANALYSIS.

Author(s):

Nitin keshao Dhote -

Abstract: Power transformer is a vital component of power system, which has no substitute for its major role. They are quite expensive also. It is therefore, very important to closely monitor it’s in –service behavior to avoid costly outages and loss of production. Many devices have evolved to monitor the serviceability of power transformers. These devices such as Buchholz relay or differential relay respond only to a severe power failure requiring immediate removal of transformer from service, in which case, outages are inevitable. Thus, preventive techniques for early detection of faults to avoid outages would be valuable. A prototype of an expert system based on Dissolved Gas Analysis (DGA) technique for diagnosis of suspected transformers faults and their maintenance action are developed. The synthetic method is proposed to assist the popular gas ratio methods. This expert system is implemented into PC by using “Turbo Prolog” with rule based knowledge representations. The designed expert system has been tested for N.T.P.C., Talcher (India) transformer’s gas ratio records to show its effectiveness in transformer diagnosis.

Title:

PRACTICAL APPLICATION OF KDD TECHNIQUES TO AN INDUSTRIAL PROCESS

Author(s):

Victoria Pachón Álvarez

Abstract: In the process of smelting copper mineral a large amount of sulphuric dioxide (SO2) is produced. This compound would be highly pollutant if it was emitted to the atmosphere. By means of an acid plant it is possible to transform it into sulphuric acid, using for this a set of chemical and physical processes. In this way we dispose of a marketable product and, at the same time, the environment is protected. However, there are certain situations in which the gases escape to the atmosphere, creating pollutant situations. This would be avoidable if we exactly knew under which circumstances this problem is produced. In this paper we present a practical application of KDD techniques to the chemical industry. By means of the obtained results we show the viability of using automatic classifiers to improve a productive process, with an increase of the production and a decrease of the environmental pollution

Title:

DATABASES REDUCTION

Author(s):

Jesús S. Aguilar-Ruiz , Jose C. Riquelme , Roberto Ruiz Sánchez

Abstract: Progress in digital data acquisition and storage technology has resulted in the growth of huge databases. A great quantity of information. Nevertheless, these techniques often have high computational cost. Then, it is advisable to apply a preprocessing phase to reduce the complexity time. These preprocessing techniques are fundamentally oriented to either of the next goals: horizontal reduction of the databases or feature selection; and vertical reduction or editing. In this paper we present a new proposal to reduce databases applying sequentially vertical and horizontal reduction technics. They are based in our original works, and they use a projection concept as a method to choose examples and representative features. Results obtained are very satisfactory, because the reduced database offers the same knowledge with low added computational cost.

Title:

DATA MINING APPLICATION IN CLINICAL DATA OF PATIENTS WITH NEPHROLITHIASIS

Author(s):

Romero Paoliello , Paulo José Lage Alvarenga , Luis Enrique Zárate , Thiago Ribeiro

Abstract: Nephrolithiasis is a disease that is unknown yet a clinical treatment that determines its cure. In the adult population is esteemed an incidence around 5 to 12%, being a little lesser in the pediatric band. The renal colic, caused by nephrolithiasis, is the main disease symptom in the adults and it is observed in 14% of the pediatric patients. The disease symptoms in the pediatric patient don't follow a pattern, and this difficult the disease diagnosis. The main objective of this work is discovery the patters of the disease symptoms and identifies the apt population to acquire it. With this objective, is applied KDD methodology determining discriminant rules for the patterns of the symptoms, and with this, select the groups of patients with those sets of symptoms. The results and the conclusions of the work are presented in the end of the article.

Title:

QUALITY CONTROL USING FUZZY RULE BASED CLASSIFICATION SYSTEMS

Author(s):

Kumar Ujjwal ujjwal , Rajendra Sahu R.sahu , Rajendra Sahu R.sahu

Abstract: In recent years, Total Quality Management (TQM) has captured the worldwide attention and is being adopted in many organization both profit and non-profit. The aim of this paper is to generate those rules from the existing data that affect the quality of the product and use the generated rules to construct a Fuzzy Inference System (FIS) which can be used for product classification under the categories of Good, Average and Poor. The rules will incorporate all the important attributes that affect a particular product. This paper uses the concept of Fuzzy Inference Systems (FIS) which are widely used for process simulation or control. They can be designed either from expert knowledge or from data. For complex systems, FIS based on expert knowledge only may suffer from a loss of accuracy. This is the main incentive for using fuzzy rules inferred from data. In the synthesis of a fuzzy system from data two steps are generally employed: automatic rule generation and system optimization. This paper analyzes the grid partitioning approach of extracting rules from data and then it focuses on how the rules can be optimized and how the developed rules can be used for product classification on the basis of their quality.

Title:

OBJECTMINER: A NEW APPROACH FOR MINING COMPLEX OBJECTS

Author(s):

Rafael Berlanga , Roxana Danger , José Ruíz-Shulcloper

Abstract: Since their introduction in 1993, association rules have been successfully applied to the description and summarization of discovered relations between attributes in a large collection of objects. However, most of the research works in this area have focused on mining simple objects, usually represented as a set of binary variables. The proposed work presents a framework for mining complex objects, whose attributes can be of any data type (single and multi-valued). The mining process is guided by the semantics associated to each object feature, which is stated by users by providing both a comparison criterion and a similarity function over the object subdescriptions. Experimental results show the usefulness of the proposal.

Title:

INFORMATION ACCESS VIA TOPIC HIERARCHIES AND THEMATIC ANNOTATIONS FROM DOCUMENT COLLECTIONS

Author(s):

Hermine Njike Fotzo

Abstract: With the development and the availability of large textual corpora, there is a need for enriching and organizing these corpora so as to make easier the research and navigation among the documents. The Semantic Web research focuses on augmenting ordinary Web pages with semantics. Indeed, wealth of information exists today in electronic form, they cannot be easily processed by computers due to lack of external semantics. Furthermore, the semantic addition is an help for user to locate, process information and compare documents contents. For now, Semantic Web research has been focused on the standardization, internal structuring of pages, and sharing of ontologies in a variety of domains. Concerning external structuring, hypertext and information retrieval communities propose to indicate relations between documents via hyperlinks or by organizing documents into concepts hierarchies, both being manually developed. We consider here the problem of automatically structuring and organizing corpora in a way that reflects semantic relations between documents. We propose an algorithm for automatically inferring concepts hierarchies from a corpus. We then show how this method may be used to create specialization/generalization links between documents leading to document hierarchies. As a byproduct, documents are annotated with keywords giving the main concepts present in the documents. We also introduce numerical criteria for measuring the relevance of the automatically generated hierarchies and describe some experiments performed on data from the LookSmart and New Scientist web sites.

Title:

LEARNING BAYESIAN NETWORKS WITH LARGEST CHAIN GRAPHS

Author(s):

Mohamed BENDOU , Paul MUNTEANU

Abstract: This paper proposes a new approach for designing learning bayesian network algorithms that explore the structure equivalence classes space. Its main originality consists in the representation of equivalence classes by largest chain graphs, instead of essential graphs which are generally used in the similar task. We show that this approach drastically simplifies the algorithms formulation and has some beneficial aspects on their execution time.

Title:

MODEL-BASED COLLABORATIVE FILTERING FOR TEAM BUILDING SUPPORT

Author(s):

Alípio Jorge , Miguel Veloso , Paulo Azevedo

Abstract: In this paper we describe an application of recommender systems to team building in a company or organization. The recommender system uses a collaborative filtering model based approach. Recommender models are sets of association rules extracted from the activity log of employees assigned to projects or tasks. Recommendation is performed at two levels: first by recommending a single team element given a partially built team; and second by recommending changes to a complete team. The methodology is applied to a case study with real data. The results are evaluated through experimental tests and a users’ perception survey.

Title:

NEW ENERGETIC SELECTION PRINCIPLE IN DIFFERENTIAL EVOLUTION

Author(s):

Vitaliy Feoktistov

Abstract: The Differential Evolution (DE) algorithm goes back to the class of Evolutionary Algorithms and inherits its philosophy and concept. Possessing only three control parameters (size of population, differentiation and recombination constants) DE has promising characteristics of robustness and convergence. In this paper we introduce a new principle of Energetic Selection. It consists in both decreasing the population size and the computation efforts according to an energetic barrier function which depends on the number of generation. The value of this function acts as an energetic filter, through which can pass only individuals with lower fitness. Furthermore, this approach allows us to initialize a population of a sufficient (large) size. This method leads us to an improvement of algorithm convergence.

Title:

CASE-BASED APPROACH FOR EFFICIENT REDESIGN OF BUSINESS PROCESS

Author(s):

Farhi Marir

Abstract: Business Process Redesign (BPR) addresses the reengineering of one specific process within the firm. It helps rethinking a process in order to enhance its performance. Academics and Business practitioners have been developing methodologies to support the application of BPR principles. However, most methodologies generally lack actual guidance on deriving a process design threatening the success of BPR. In this paper, we suggest the use of a case-based reasoning technique (CBR) to support solving new problems by adapting previously successful solutions to similar problems. We investigate how CBR can support a BPR implementation. An implementation framework for BPR and the CBR’s cyclical process are used as a knowledge management technical support to serve for the effective reuses of redesign methods as a knowledge creation and sharing mechanism. This is developed in an attempt to improve the level of success of BPR implementation by using case stories.

Title:

TOWARDS HIGH DIMENSIONAL DATA MINING WITH BOOSTING OF PSVM AND VISUALIZATION TOOLS

Author(s):

Thanh-Nghi Do

Abstract: In the recent years support vector machines (SVM) have been successfully applied to a large number of applications. Training a SVM usually needs a quadratic programming, so that the learning task for large data sets requires large memory capacity and a long time. Proximal SVM proposed by Fung and Mangasarian is a new SVM formulation. It is very fast to train because it requires only the solution of a linear system. We have used the Sherman-Morrison-Woodbury formula to adapt the PSVM to process data sets with a very large number of attributes. We have extended this idea by applying boosting to PSVM for mining massive data sets with simultaneously very large number of data points and attributes. We have evaluated its performance on UCI, Twonorm, Ringnorm, Reuters-21578 and Ndc data sets. We also propose a new graphical tool for trying to interpret the results of the new algorithm by displaying the separating frontier between classes of the data set. This can help the user to deeply understand how the new algorithm can work.

Title:

ROBUST, GENERALIZED, QUICK AND EFFICIENT AGGLOMERATIVE CLUSTERING

Author(s):

Manolis Wallace

Abstract: Hierarchical approaches, which are dominated by the generic agglomerative clustering algorithm, are suitable for cases in which the count of distinct clusters in the data is not known a priori; this is not a rare case in real data. On the other hand, important problems are related to their application, such as susceptibility to errors in the initial steps that propagate all the way to the final output and high complexity. Finally, similarly to all other clustering techniques, their efficiency decreases as the dimensionality of their input increases. In this paper we propose a robust, generalized, quick and efficient extension to the generic agglomerative clustering process. Robust refers to the proposed approach's ability to overcome the classic algorithm's susceptibility to errors in the initial steps, generalized to its ability to simultaneously consider multiple distance metrics, quick to its suitability for application to larger datasets via the application of the computationally expensive components to only a subset of the available data samples and efficient to its ability to produce results that are comparable to those of trained classifiers, largely outperforming the generic agglomerative process.

Title:

TOWARDS VISUAL DATA MINING

Author(s):

Francois Poulet

Abstract: In this paper, we present our work in a new data mining approach called Visual Data Mining (VDM). This new approach tries to involve more intensively the user (being the data expert not a data mining or analysis specialist) in the data mining process and to increase the part of the visualisation in this process. The visualisation part can be increased with cooperative tools: the visualisation is used as a pre or post processing step of usual (automatic) data mining algorithms, or the visualisation tools can be used instead of the usual automatic algorithms. All these topics are addressed in this paper with an evaluation of the algorithms presented and a discussion of the interactive algorithms compared with automatic ones. All this work must be improved in order to allow the data specialists to use efficiently these kinds of algorithms to solve their problems.

Title:

HYBRID EXPERT SYSTEM FOR THE SELECTION OF RAPID PROTOTYPING PROCESSES

Author(s):

Farhi Marir

Abstract: A wide variety of rapid prototyping processes are available, each with different and unique features. Selecting the most suitable process for a given prototype can be difficult and costly if a mistake is made. In this paper, the design of a knowledge-based system to support the selection of a rapid prototyping process is presented. The method utilises a hybrid expert system, which is formulated to interrogate the acquired data streams from a rapid prototyping model simulator for the purpose of comparative studies with the knowledge base.

Title:

A CONNEXIONIST APPROACH FOR CASE BASED REASONING

Author(s):

José María de la Torre , Miguel Delgado , Eva Gibaja , Antonio B. Bailón

Abstract: Case Based Learning is an approach to automatic learning and reasoning based on the use of the knowledge gained in past experiences to solve new problems. To suggest a solution for a new problem it is necessary to search for similar problems in the base of problems for which we know their solutions. After selecting one or more similar problems their solutions are used to elaborate a suggested solution for the new problem. Associative memories recover patterns based on their similarity with a new input pattern. This behaviour made them useful to store the base of cases of a Case Based Reasoning system. In this paper we analyze the use of a special model of associative memory named CCLAM \cite{bailon2002a} with this objective. To test the potentiality of the tool we will discuss its use in a particular application: the detection of the ``health'' of a company.

Title:

INTELLIGENT VIRTUAL ENVIRONMENTS FOR TRAINING IN NUCLEAR POWER PLANTS

Author(s):

Pilar Herrero , Gonzalo Mendez , Angelica de Antonio

Abstract: Educational Virtual Environments are gaining popularity as tools to enhance student learning. These environments are often used to allow students to experience situations that would be difficult, costly, or impossible in the physical world. At the Technical University of Madrid we have developed several applications to explore the use of intelligent tutors in VR. In this paper we present two of these applications which have been used for training in radiological protection in Nuclear Power Plants (NPP). These applications are inhabited by avatars and/or agents which are continuously monitoring the state of the environment and manipulating it periodically through virtual motor actions. Our applications help students learn to perform physical, procedural tasks in some different risky areas of NPP.

Title:

BAYESIAN NETWORK CLASSIFIERS VERSUS K-NN CLASSIFIER USING SEQUENTIAL FEATURE SELECTION

Author(s):

Franz Pernkopf , Djamel  Bouchaffra

Abstract: The aim of this paper is to compare Bayesian network classifiers to the k-NN classifier based on a subset of features. This subset is established by means of sequential feature selection methods. Experimental results show that Bayesian network classifiers more often achieve a better classification rate on different data sets as selective k-NN classifiers. The $k$-NN classifier performs well in the case where the number of samples for learning the parameters of the Bayesian network is small. Bayesian network classifiers outperform selective k-NN methods in terms of memory requirements and computational demands. This paper demonstrates the strength of Bayesian networks for

Title:

G.R.E.E.N. AN EXPERT SYSTEM TO IDENTIFY GYMNOSPERMS

Author(s):

Antonio Bailón , Eva Gibaja

Abstract: The application of Artificial Intelligence techniques to the problem of botanical identification is not particularly widespread even less so on Internet. There are several interactive identification systems but they usually deal with raw knowledge so it appears that “research and development of web-based expert systems are still in their early stage” (Li et al., 2002). In this paper we present the G.R.E.E.N. (Gymnosperms Remote Expert Executed Over Networks) System as an Expert System for the identification of Iberian Gymnosperms which allows on-line uncertainty queries to be made. The System is operative and it can be consulted in http://drimys.ugr.es/experto/index.html.

Title:

ADAPTATIVE TECHNIQUES FOR THE HUMAN FACES DETECTION

Author(s):

João Fernando Marar , Danilo Nogueira Costa

Abstract: This paper presents results from an efficient approach to an automatic detection and extraction of human faces from images with any color, texture or objects in background, that consist in find isosceles triangles formed by the eyes and mouth.

Title:

OLIMPO SYSTEM WEB-TECNOLOGY FOR ELECTRONIC GOVERNMENT AND PACE WORLD

Author(s):

Andre Bortolon , Hugo Cesar Hoeschl , Tania Bueno , Eduardo Mattos , Vania Ferreira

Abstract: The paper describes the Olimpo System, a knowledge-based system that enables the user to access textual files and to retrieve information that is similar to the search context described by the user in natural language. The paper is focused on the innovation recently implemented on the system and its new features. A detailed description is presented about the search level and the similarity metrics used by the system. The methodology applied to the Olimpo system emphasises the use of information retrieval methods combined with the Artificial Intelligence technique named SCS (Structured Contextual Search).

Title:

DESIGN AND IMPLEMENTATION OF A SCALABLE FUZZY CASE-BASED MATCHING ENGINE

Author(s):

Jonas  Van Poucke , Bartel Van de Walle , Rami Hansenne , Veerle Van der Sluys

Abstract: We discuss the design and the implementation of a flexible and scalable fuzzy case-based matching engine. The engine’s flexible design is illustrated for two of its core components: the internal representation of cases by means of a variety of crisp and fuzzy data types, and the fuzzy operations to execute the ensuing case matching process. We investigate the scalability of the matching engine by a series of benchmark tests of increasing complexity, and find that the matching engine can manage an increasingly heavy load. This indicates that the engine can be used for demanding matching processes. We conclude by pointing at several applications in experimental electronic markets for which the matching engine currently is being put to use, and indicate avenues for future research.

Title:

INFORMED K-MEANS: A CLUSTERING PROCESS BIASED BY PRIOR KNOWLEDGE

Author(s):

Wagner Castilho , Hércules do Prado , Marcelo Ladeira

Abstract: Knowledge Discovery in Databases (KDD) is the process by which unknown and useful knowledge and information are extracted, by automatic or semi-automatic methods, from large amounts of data. Along the evolution of Information Technology and the rapid growth in the number and size of databases, the development of methodologies, techniques, and tools for data mining has become a major concern for researchers, and has led, in turn, to the development of applications in a variety of areas of human activity. About 1997, the processes and techniques associated with cluster analysis had begun to be researched with increasing intensity by the KDD community. Within the context of a model intended to support decisions based on cluster analysis, prior knowledge about the data structure and the application domain can be used as important constraints that lead to better results in the clusters’ configurations. This paper presents an application of cluster analysis in the area of public safety using a schema that takes into account the burden of prior knowledge acquired from statistical analysis on the data. Such an information was used as a bias for the k-means algorithm that was applied to identify the dactyloscopic (fingerprint) profile of criminals in the Brazilian capital, also known as Federal District. These results was then compared with a similar analysis that disregarded the prior knowledge. It is possible to observe that the analysis using prior knowledge generated clusters that are more coherent with the expert knowledge.

Title:

NEURAL NETWORK AND TIME SERIES AS TOOLS FOR SALES FORECASTING

Author(s):

Maria Emilia Camargo , Walter Priesnitz Filho , Angela Isabel dos Santos

Abstract: This paper presents the use of times series AutoRegressive Integrated Moving Average (ARIMA) ARIMA model with interventions, and neural network back-propagation model in analyzing the behavior of sales in a medium size enterprise located in Rio Grande do Sul Brazil for the period January 1979 December 2002. The forecasts obtained using the back-propagation model were found to be more accurate than those of ARIMA model with interventions.

Title:

A SYMBOLIC APPROACH TO LINGUISTIC NEGATION

Author(s):

Daniel PACHOLCZYK , Mazen EL-SAYED

Abstract: Negation processing is a challenging problem studied by a large number of researchers from different communities. This paper focuses on the linguistic negation rather than on the logical one. Our work is based on the main standard forms of linguistic negation interpretations represented as "x is not A". The reference frame associated with a standard form contains all its positive interpretations. The main goal of dealing with negation is the selection of one (or several) positive interpretation(s) associated with a negative sentence from its reference frame. The originality of our approach results from the fact that we do not research directly all affirmative interpretations of a negation, but we approximate its significance. We introduce two operators, one is optimistic and the other is pessimistic. They are defined according to rough set theory. By using the new negation formulation, we propose several generalizations of the Modus Ponens rule dealing with negative information. The new model is proposed within a symbolic many-valued predicate logic.

Title:

DYNAMIC INTEREST PROFILES: TRACKING USER INTRESTS USING PERSONAL INFORMATION

Author(s):

Joann Ruvolo , Justin Lessler , Vikas Krishna , Stefan Edlund

Abstract: When building applications it is usually the case that developers are forced to focus on “one size fits all” solutions. Customization is often burdensome for the user, or would be so complex that it would be unrealistic to ask an end user to undertake this task. In the areas of personal information management and collaboration there is no reason to accept this limitation, as there is a body of information about the user that reflects their interests: namely their personal documents. The Dynamic Interest Profile (DIP) is a system intended to track user interest to allow for the creation of more intelligent applications. In this paper we discuss our approach to implementing the DIP, challenges that this implementation presents, as well as the security and privacy concerns that the existence of such an application raises.

Title:

A FAST SCALE AND POSE INVARIANT FACE RECOGNITION METHOD

Author(s):

Dr younus Javed , Almas Anjum , Aamir Nadeem

Abstract: Abstract The high speed computing, database, networking technologies and sophisticated image processing methodologies have increased the topical significance of face recognition. The proposed system is a scale invariant face recognition model which works on reduced size of image to increase the speed and to reduce the complex computation .The approach transforms face images into a small set of characteristic features image matrices which are the principle component of the initial training set of images. On the basis of small sets of features a general matrix and difference matrices of the normalized images are formed which ultimately provide a base for the recognition of face. This model consists of two parts. The first part is conversion of RGB into gray image with averaging of RGB values and preprocessing of image. In the second part, the recognition is performed by projecting a test image to the face space spanned by general matrix, an error matrix is obtained and ultimately compared with difference matrices of all the training images and minimum error gives the recognized image. Recognition under reasonably varying conditions is achieved by training on a limited number of images with different poses and intensity levels. This approach has advantages over other face recognition schemes in its speed, simplicity, learning capacity and relative insensitivity to small or gradual changes of pose and intensity level in the face images and its size.

Title:

DYNAMIC NEGOTIATION FOR REAL-TIME MANUFACTURING EXECUTION

Author(s):

Li Qun Zhuang , Jing Bing Zhang , Bryan Tsong Jye Ng , Yi Zhi Zhao , Yue Tang

Abstract: This paper presents a dynamic negotiation framework for real-time execution in self-organised manufacturing environments. The negotiation strategies in this framework bridge the gap between distributed negotiation of self-interested agents and cooperative negotiation among agent groups. In particular, the proposed framework is based on the model of Performance and Cost for Manufacturing Execution (PCME). By forming the dynamic organisation called agent consortium, individual agent negotiates over the PCME in order to optimise the resource allocation under time constraints and uncertainty of job execution, and resolves the conflicts to fulfil the goal of the overall system. The ultimate goal of the framework is to reduce the negotiation time, make effective use of resources, adapt to the changes in execution and increase the throughput of the entire system. Experimental work based on PCME has been carried out to demonstrate the high performance of this approach despite unanticipated and dynamic changes in the manufacturing execution environments.

Title:

VISUALIZING SOFTWARE PROJECT ANALOGIES TO SUPPORT COST ESTIMATION

Author(s):

Martin Auer

Abstract: Software cost estimation is a crucial task in software project portfolio decisions like start scheduling, resource allocation, or bidding. A variety of estimation methods have been proposed to support estimators. Especially the analogy-based approach---based on a project's similarities with past projects---has been reported as both efficient and relatively transparent. However, its performance was typically measured automatically and the effect of human estimators' sanity checks was neglected. Thus, this paper proposes the visualization of high-dimensional software project portfolio data using multidimensional scaling (MDS). We (i) propose data preparation steps for an MDS visualization of software portfolio data, (ii) visualize several real-world industry project portfolio data sets and quantify the achieved approximation quality to assess the feasibility, and (iii) outline the expected benefits referring to the visualized portfolios' properties. This approach offers several promising benefits by enhancing portfolio data understanding and by providing intuitive means for estimators to assess an estimate's plausibility.

Title:

ORDER PLANNING DECISION SUPPORT SYSTEM FOR CUSTOMER DRIVEN MANUFACTURING: OVERVIEW OF MAIN SYSTEM REQUIREMENTS

Author(s):

Américo  Azevedo , Henrique  Proença

Abstract: An important goal in schedule production orders through a manufacturing facility is to assure that the work is completed as close as possible to its due date. Work that is late creates downstream delays, while early completion can be detrimental if storage space is limited. Production planning and control manufacturing is becoming more difficult as family products increase and quantity decreases. This paper presents an ongoing information system development that aims the production planning of special test tables equipment for automobile components manufacturers. The simulated based information system will be used to support planning and schedule activities; to compare and analyze the impact of planning rescheduling; to forecast the production completion date; to detect bottlenecks and to evaluate machines performance.

Title:

AN EXPERIENCE IN MANAGEMENT OF IMPRECISE SOIL DATABASES BY MEANS OF FUZZY ASSOCIATION RULES AND FUZZY APPROXIMATE DEPENDENCIES

Author(s):

J.M. Serrano , M. Sánchez-Marañón , Daniel Sánchez , M.A. Vila , G. Delgado , J. Calero

Abstract: In this work, we start from a database built with soil information from heterogeneous scientific sources (Local Soil Databases, LSDB). We call this an Aggregated Soil Database (ASDB). We are interested in determining if knowledge obtained by means of fuzzy association rules or fuzzy approximate dependencies can represent adequately expert knowledge for a soil scientific, familiarized with the study zone. A master relation between two soil attributes was selected and studied by the expert, in both ASDB and LSDB. Obtained results reveal that knowledge extracted by means of fuzzy data mining tools is significatively better than crisp one. Moreover, it is highly satisfactory from the soil scientific expert’s point of view, since it manages with more flexibility imprecision factors (IFASDB) commonly related to this type of information.

Title:

DECISION FOLLOW-UP SUPPORT MECHANISM BASED ON ASYNCHRONOUS COMMUNICATION

Author(s):

Wolfgang Prinz , Carla Valle

Abstract: Decision management and decision support systems are themes under investigation for several decades, and both research areas provided contributions for the quality of decision making processes. However, little work has been done in the area of decision follow-up, especially regarding decisions made during meetings. In this paper we analyse the concepts related to this problem and we propose a solution based on mechanisms supported by computer to assist the formalization of meeting outcomes, and to provide decision follow-up.

Title:

THE ORM MODEL AS A KNOWLEDGE REPRESENTATION FOR E-TUTORIAL SYSTEMS

Author(s):

tanaporn leelawattananon , suphamit chittayasothorn

Abstract: At present information technology plays important roles in teaching and learning activities. E-learning systems have the potential to reduce operating costs and train more people. Teachers and students do not have to be in the same place at the same time and the students have the opportunity to perform self-studies and self-evaluation using e-tutorial systems. E-learning systems could be considered expert systems in the sense that they provide expert advice in particular subjects of studies to students. The exploitation of knowledge base and knowledge representation techniques is therefore vital to the development of e-learning systems. This paper presents the development of a knowledge-based e-tutorial system that uses the Object Role Model (ORM) as its knowledge representation. The system provides Physics tutorials. It was implemented in Prolog and the knowledge base is on a relational database server.

Title:

IMPLEMENTING KNOWLEDGE MANAGEMENT TECHNIQUES FOR SECURITY PURPOSES

Author(s):

Ioannis  Drakopoulos , Petros Belsis , Stefanos Gritzalis , Christos Skourlas

Abstract: Due to its rapid growth, Information Systems Security becomes a new era of expertise, related to a vast quantity of knowledge. Exploiting all this knowledge becomes a difficult task, due to its heterogeneity. Knowledge Management (KM) on the other hand, becomes an expanding and promising discipline that has drawn considerable attention. In this paper we deploy our arguments about the benefits of KM techniques and their possible applications to assist security officers in improving their productivity and effectiveness. To prove this, we exploit possible technological prospects, and we present the architecture of a prototype developed to implement selected innovating KM components, embedding state-of-the-art multimedia java-based applications.

Title:

MAJORITY VOTING IN STABLE MARRIAGE PROBLEM WITH COUPLES

Author(s):

Tarmo Veskioja

Abstract: Providing centralised matching services can be viewed as a group decision support system (GDSS) for the participants to reach a stable matching solution. In the original stable marriage problem all the participants have to rank all members of the opposite party. Two variations for this problem allow for incomplete preference lists and ties in preferences. If members from one side are allowed to form couples and submit combined preferences, then the set of stable matchings may be empty (Roth et al., 1990). In that case it is necessary to use majority voting between matchings in a tournament. We propose a majority voting tournament method based on monotone systems and a value function for it. The proposed algorithm should minimize transitivity faults in tournament ranking.

Title:

OUTLIER DETECTION AND VISUALISATION

Author(s):

Lydia BOUDJELOUD , François POULET

Abstract: Abstract: The outlier detection problem has important applications in the field of fraud detection, network robustness analysis, and intrusion detection. Most such applications are high dimensional domains in which the data can contain hundreds of dimensions. However, in high dimensional space, the data is sparse and the notion of proximity fails to retain its meaningfulness. Many recent algorithms use heuristics such as genetic algorithms, the taboo search... in order to palliate these difficulties in high dimensional data. We present in this paper a new hybrid algorithm for outlier detection in high dimensional data. We evaluate the performances of the new algorithm on different high dimensional data sets, and visualise results for some data sets.

Title:

MULTI-AGENT ORGANISATIONAL MODEL FOR E-CONTACTING

Author(s):

Djamel  KHADRAOUI

Abstract: The paper covers the development and analysis tools, software and system architecture engineering, and development methodologies. It introduces the MOISE+ model for organizing agents inside a multi-agent system and it discussed the MOISE Java API. The Model of Organization for multI-agent System is a organizational model for Multi-Agent System seen under three points of view: structural, functional and deontic. In practical, this model is available as a JAVA component. The original contribution of the paper is the extension of the model to take into account the notion of sanctions. These are necessary in order to control the respect of normative specifications (obligation, permission, prohibition) of behaviors. The results of the generalized model is implemented on an eBusiness application dealing with eContracting.

Title:

MULTI-AGENT PROPOSITIONS TO MANAGE ORGANIZATIONAL KNOWLEDGE: POSITION PAPER CONCERNING A THREE-DIMENSIONAL RESEARCH PROJECT

Author(s):

Francisco  Guimarães , César  Rosa , Jorge  Louçã , Valmir  Meneses

Abstract: This paper presents the work in progress in a three-dimensional project, including the theoretical foundations and main goals of the lines of research incorporating our project: user modeling in a distributed cooperative system, interactive cooperation in a multi-agent structure, and knowledge representation in a cognitive agent architecture. These lines of research are complementary and share a main goal, to make propositions regarding the use of multi-agent systems in organizations, namely in what concerns support to decision making processes and, in a general way, knowledge management within organizations.

Title:

AN AGENT-BASED INFRASTRUCTURE FOR FACILITATING EVIDENCE-BASED HEALTH CARE

Author(s):

Jennifer Sampson

Abstract: Evidence-based medicine relies heavily on the timely dissemination of ‘best evidence’ to a wide audience of health practitioners (Atkins and Louw, 2000). However, finding, assimilating and using this information resource effectively can be difficult. In this paper we describe an infrastructure for facilitating evidence-based health care using Agora - a multi-agent system. This paper discusses our extensions to AGORA, and also describes issues for disseminating such medical knowledge via an adaptive, intelligent, distributed, mobile information service. We describe how an agent based approach can deliver clinical cases and diagnosis information to clinicians at point of care tailored to her/his needs. This research in progress is particularly important for the facilitating flow of information in health care.

Title:

AN ALGORITHM FOR LINEAR BILEVEL PROGRAMMING PROBLEMS

Author(s):

Jie Lu , Chenggen Shi

Abstract: For linear bilevel programming problems, the branch and bound algorithm is the most successful algorithm to deal with the complementary constraints arising from Kuhn-Tucker conditions. This paper proposes a new branch and bound algorithm for linear bilevel programming problems. Based on this result, a web-based bilevel decision support system is developed.


AREA 3 - Information Systems Analysis and Specification
 
Title:

PRIVACY CONCERNS IN INTERNET APPLICATIONS

Author(s):

Seev Neumann , Moshe Zviran

Abstract: The Merriam-Webster Dictionary defines privacy as “freedom from unauthorized intrusion”. While privacy has been a sensitive issue long before the advent of computers, the concern has been significantly elevated by the widespread use of large databases that make it easy to compile a dossier about an individual from many data sources. The problem of privacy has been further exacerbated by the fact that the Web makes it easy for new data to be automatically collected and added to databases and analyzed by sophisticated data mining tools and personalized marketing services. This study explores the nature of the privacy concern in detail, especially for the online environment. The objective of this study is to get a better understanding of the factors that can affect online privacy concerns and how this concern could affect the users’ behavior and the future of the Internet and electronic commerce.

Title:

A NEW VULNERABILITY TAXONOMY BASED ON PRIVILEGE ESCALATION

Author(s):

Yongzheng Zhang , Xiaochun Yun

Abstract: On the basis of analysis of research achievements of typical vulnerability taxonomies in the world, a privilege-escalating based vulnerability taxonomy with multidimensional quantitative attribute is presented in this paper. Then we give examples of three vulnerabilities to illustrate the characteristics of this taxonomy, and present the risk evaluation formula and ranks of the evaluation levels of risk.

Title:

A COMPARATIVE STUDY OF ELGAMAL BASED CRYPTOGRAPHIC ALGORITHMS

Author(s):

Ramzi Haraty , Hadi Otrok

Abstract: 038

Cryptography is the art or science of keeping messages secret. People mean different things when they talk about cryptography. Children play with toy ciphers and secret languages. However, these have little to do with real security and strong encryption. Strong encryption is the kind of encryption that can be used to protect information of real value against organized criminals, multinational corporations, and major governments. Strong encryption used to be only in the military domain; however, in the information society it has become one of the central tools for maintaining privacy and confidentiality. 038

As we move further into an information society, the technological means for global surveillance of millions of individual people are becoming available to major governments. Cryptography has become one of the main tools for privacy, trust, access control, electronic payments, corporate security, and countless other fields. 038

Perhaps the most striking development in the history of cryptography came in 1976 when Diffie and Hellman published $New$ $Directions$ $in$ $Cryptography$ [3]. Their work introduced the concept of public-key cryptography and provided a new method for key exchange. This method is based on the intractability of discrete logarithm problems. Although the authors had no practical realization of a public-key encryption scheme at the time, the idea was clear and it generated extensive interests and activities in the world of cryptography. One of the powerful and practical public-key schemes was produced by ElGamal in 1985 [4]. 038

El-Kassar and Awad [1][6] modified the ElGamal public-key encryption schemes from the domain of natural integers, $Z$, to two principal ideal domains, namely the domain of Gaussian integers, $Z[i]$, and the domain of the rings of polynomials over finite fields, $F[x]$, by extending the arithmetic needed for the modifications to these domains. 038

In this paper, we compare and evaluate the classical and modified ElGamal algorithms by implementing and running them on a computer. We investigate the issues of complexity, efficiency and reliability by running the programs with different sets of data. Moreover, comparisons will be done between these different algorithms given the same data as input. In addition, implementation of an attack algorithm will be presented. The attack algorithm consists of subroutines used to crack encrypted messages. This is done by applying certain mathematical concepts to find the private key of the encrypted message. After finding the key, it will be easy to decrypt the message. A study will be done using the results of running the attack algorithm to compare the security of the different classical and modified cryptographic algorithms.


Title:

ON INFORMATION SECURITY GUIDELINES FOR SMALL/MEDIUM ENTERPRISES

Author(s):

David Chapman , Leonid Smalov

Abstract: The adoption rate of Internet-based technologies by United Kingdom (UK) Small and Medium Enterprises (SMEs) is well-documented. Over several decades information security has evolved from early work such as the Bell La Padula (BLP) model toward widely disseminated Information Security Guidelines containing detailed advice. The overwhelming volume and level-of-detail provided often fails to address the information security requirements of SMEs. SMEs typically fail to implement effective Internet strategies due to lack of information security awareness, lack of technical skills and inadequate financial resources. Awareness of information security issues among SMEs is poor. The European Union supported ISA-EUNET Consortium has developed a set of best practices to support SMEs. We present a mapping of the Computer Security Expert Assist Team (CSEAT) Information Security Review Areas onto the Alliance for Electronic Business (AEB) web security guidelines as an example of a possible roadmap approach for SMEs to gain information security awareness.

Title:

ANALYSIS AND CONFIGURATION METHODOLOGY FOR VIDEO ON DEMAND SERVICES BASED ON MONITORING INFORMATION AND PREDICTION

Author(s):

Ángel Neira , Xabiel García Pañeda , David  Melendi Palacio , David Melendi , Roberto García , Víctor García

Abstract: This paper presents an analysis and configuration methodology for video-on-demand services. Usually, two entities take part in this kind of services: a network operator and a content provider. The former provides an Internet connection and manages servers and proxies, whereas the latter, normally a communication media, generates the provided contents. All their possibilities of configuration must be based on an accurate service behavioural analysis which evaluates the quality and the quantity of resources, contents and subscribers. This analysis can be performed using monitoring information and predictions of a near future behaviour established by managers. To formalize both analysis and configuration, a methodology must be developed in order to help the service managers to attain a good performance and at the same time, make a profit for their companies.

Title:

DESCRIBING SOFTWARE-INTENSIVE PROCESS ARCHITECTURES USING A UML-BASED ADL

Author(s):

Ilham ALLOUI , Flavio OQUENDO

Abstract: Many Architecture Description Languages (ADLs) have been proposed in the software architecture community, with several competing notations, each of them bringing its own body of specification languages and analysis techniques. The aim of all is to reduce the costs of error detection and repair while providing adequate abstractions for modelling large software-intensive systems and establishing properties of interest. However, there now exists a large consensus to standardise on notations and methods for software analysis and design as standardisation provides an economy of scale that results in various and better tools, better interoperability between tools, more available developers skilled in using the standard notation, and lower training costs. Therefore software-intensive process architectures can be relevantly described using a standard-compliant design notation. Among such notations, the UML modelling language that on one side makes use of visual notations and on the other side, is an emerging standard software design language and a starting point for bringing architectural modelling into industrial use. This paper presents an architecture-centred UML-based notation to describe software process architectures. The architectural concepts have already been formally defined in an Architecture Description textual Language. The notation is illustrated by a business-to-business process application. The main contribution of this work is to show that UML with its large and extensible set of predefined constructs imposes itself as a relevant candidate to be extended with the necessary architectural concepts and customisation to model software-intensive processes. The work presented is being developed and validated within the framework of the X IST 5 ongoing European project.

Title:

U_VBOOM : UNIFIED ANALYSIS AND DESIGN PROCESS BASED ON THE VIEWPOINT CONCEPT

Author(s):

Hair Abdellatif

Abstract: The introduction of viewpoint in object-oriented design provides several improvements in modeling complex systems. In fact, it enables the users to build a unique model accessible by different users with various points of view, instead of building several sub-models whose management is too hard to complete. Those concepts of view and viewpoint were implemented by VBOOL, the language which propose a new relationship "the visibility". VBOOM, the analyze/design method, integrates those concepts in an object-oriented modeling. The aims of this work are, firstly to propose a new representation of the visibility relationship of VBOOL in UML standard language for modeling and specifying object-oriented systems. Secondly, to complete UML by an oriented viewpoint method to get a complete software engineering process. The definition of this method is based on VBOOM method. This method is called U_VBOOM, which represents an adaptation of VBOOM in UML. The new representation of the visibility relationship encourages the multi-targets code generation and improve the process of development proposed by the VBOOM method.

Title:

TESTING SOFTWARE SYSTEMS FROM A USER'S PERSPECTIVE

Author(s):

Thomas Thelin

Abstract: An important attribute to whether a software system will be used is the satisfaction of the users during usage. In order to fulfil the users’ requirements during development, software inspection and testing are two important activities that are used. Software inspection is used in the first phases of development and testing is used after the system has been implemented. Several inspection and testing techniques have been developed, and some of these validate the software from the perspective of the users. Statistical usage testing (SUT) is one of these techniques, which is used to test a software product from a user's point of view. In SUT, usage models are designed to anticipate the future usage and then test cases are developed from the models. The development of test cases from the usage model can be made automatically by using a tool. This paper focuses on verification and validation from a usage perspective and presents a novel tool for SUT called MaTeLo. The purpose of the tool is to automatically produce test cases based on usage models, and to calculate important quality metrics like reliability. Furthermore, this paper describes the empirical evaluation of the tool and how SUT relates to inspection and estimation techniques with user focus.

Title:

WORKFLOW ACCESS CONTROL FROM A BUSINESS PERSPECTIVE

Author(s):

Dulce Domingos

Abstract: Workflow management systems are increasingly being used to support business processes. Methodologies have been proposed in order to derive workflow process definitions from business models. However, these methodologies do not comprise access control aspects. In this paper we propose an extension to the Work Analysis Refinement Modelling (WARM) methodology, which also enables to determine workflow access control information from the business process model. This is done by identifying useful information from business process models and showing how it can be refined to derive access control information. Our approach reduces the effort required to define the workflow access control, ensures that authorization rules are directly related to the business and aligns access control with the information system architecture that implements the business process.

Title:

USING SECURITY ATTACK SCENARIOS TO ANALYSE SECURITY DURING INFORMATION SYSTEMS DESIGN

Author(s):

Paolo  Giorgini , Haralambos Mouratidis , Gordon  Manson

Abstract: It has been widely argued in the literature that security concerns should be integrated with software engineering practices. However, only recently work has been initiated towards this direction. Most of this work, however, only considers how security can be analysed during the development lifecycles and not how the security of an information system can be tested during the analysis and design stages. In this paper we present results from the development of a technique, which is based on the use of scenarios, to test the reaction of an information system against potential security attacks.

Title:

METRICS FOR DYNAMICS: HOW TO IMPROVE THE BEHAVIOUR OF AN OBJECT INFORMATION SYSTEM

Author(s):

Maria Jose Escalona , Jean-Louis Cavarero

Abstract: If we ask about which is the main difference between modelling a system using a traditional model like the entity relationship model or an object oriented model, from our point of view the answer is that, in the first one, the processes are not located somewhere, and, in the second one, the processes (operations or methods) are encapsulated in classes. The choice of the right classes to home every operation is essential for the behaviour of the system. It is totally useless to design a well built system, according to a lot of statics metrics, if the system does not run well after. In other words, dynamic metrics allowing to evaluate the behaviour of a system when it runs are much more useful than any static metrics used to tell if the system is correctly built or not. According to this, we propose in this paper, a new approach to evaluate a priori the behaviour of a system, by taking into account the notion of event cost and the notion of time (which is obviously essential). The final goal of this approach is to deliver information on the way operations have to be placed in classes in order to get better performances when the system is running. However, the proposal of metrics is of no value if their practical use is not demonstrated, either by means of case studies taken from real projects or by controlled experiments. For this reason, an optimisation tool is being under construction in order to provide solutions to this problem.

Title:

ALIGNING BUSINESS PROCESS MODELING AND SOFTWARE SPECIFICATION IN A COMPONENT-BASED WAY, THE ADVANTAGES OF SDBC

Author(s):

Boris Shishkov , Jan L.G. Dietz

Abstract: One frequent cause of software project failure is the mismatch between the (business) requirements and the actual functionality of the delivered (software) application. In this paper, some popular methods (as well as their strengths and shortcomings) that address the mentioned problem are briefly outlined and an approach is proposed, for design of software, basing consistently this design on prior business process modeling. The alignment between these two tasks is realized in a component-based way, by deriving the software model from identified (generic) business components, thus - taking advantage of the benefits of object-orientation. The paper introduces not only the concepts of the approach but also elaborated views on how it could be implemented using particular software design and business process modeling techniques. A way to implement the approach is through UML - the standard language for designing software. The suggested approach is expected to be a useful contribution to the knowledge on aligning business process modeling and software design.

Title:

A NEW MODEL TO MANAGE IDS ALERTS

Author(s):

Walter Godoy Junior , Marco Bonato

Abstract: The goal of this paper is to present a new model to reduce the alerts generated by an IDS analyzer. This model allows the administrator to analyze only the messages that really generate risks for an environment or machine. This is very important when you have a complex environment with a lot of machines with many services in them.

Title:

CONSTRAINT-GUIDED ENTERPRISE PORTALS

Author(s):

Frank Kriwaczek , Christopher Hogger

Abstract: It is shown how an enterprise portal, supporting a community of users discharging roles expressed as combinations of plans and constraints, can be usefully guided by a constraint processor. In particular, constraint logic programming on finite domains provides the users with useful insights regarding their possible work schedules. Constraints assist also in shaping the electronic artefacts created and transmitted by the users. The implementation is supported by mechanisms for assigning and updating roles and for assisting the search for remedies in the case of constraint failure.

Title:

MODEL CHECKING AN OBJECT-ORIENTED DESIGN

Author(s):

Simon C Stanton , Vishv Malhotra

Abstract: Object classes are the building blocks in developing object-oriented software. The design methodologies have focused on methods, tools and representations to build classes taking advantages of inheritance and encapsulation properties. The guiding principle being that if all classes are correctly constructed a system consisting of objects in these classes will be correct. Efforts to include object constraints in the object-oriented programs have not attained the role commensurate with the role invariants play in the traditional imperative programs in understanding the programs and in enforcing correctness properties. The paper describes use of a model checker to establish correctness of an object-oriented design.

Title:

A TECHNIQUE FOR INTRODUCING STEREOTYPES INTO UML TOOLS

Author(s):

Miroslaw  Staron , Ludwik Kuzniarz

Abstract: The Unified Modeling Language is a general-purpose, visual object-oriented modeling language, which can be used for a variety of purposes. However, the usage of the language for specific purposes and needs can be done by customization with the help of the built-in extension mechanisms. The customization must be supported by the tools used to produce models in the software development. This paper elaborates on the capabilities of UML tools which results in identification of some problems. The paper proposes an alternative way of introducing stereotypes, which is independent of UML tools used, based on the Extensible Metadata Interchange (XMI) format and related XML technologies. The method is compared with the introduction of stereotypes directly into UML tools by an example design.

Title:

THE COMPONENT BASED PROGRAMMING MODEL FOR LINUX (CBPM)

Author(s):

Ali Raza , Omer Muhammad , Sikander Hayat , Imran Gondal

Abstract: Contemporary component model development is getting more and more important in software industry. Academic world is spending long time on development and refinement of their component models and rarely do they consider the alternative of not using a bridge. We propose and have implemented the Component based Programming Model (CBPM) for Linux which removes the usage of bridge while conforming to Component Object Model. CBPM aims to lower software development cost by providing sophisticated facilities for Component Object Model (COM) based component reuse on Linux. CBPM also focuses on eliminating the bridging overheads for using COM component. CBPM defines a standard for component interoperability, is not dependent on any particular programming language and is extensible.

Title:

EFFECTIVE XML REPRESENTATION FOR SPOKEN LANGUAGE IN ORGANISATIONS

Author(s):

Philip Windridge , Dali Dong , Rodney Clarke

Abstract: Spoken Language can be used to provide insights into organisational processes, unfortunately transcription and coding stages are very time consuming and expensive. The concept of partial transcription and coding is proposed in which spoken language is indexed prior to any subsequent processing. The functional linguistic theory of texture is used to describe the effects of partial transcription on observational records. The standard used to encode transcript context and metadata is called CHAT, but a previous XML schema developed to implement it contains design assumptions that make it difficult to support partial transcription for example. This paper describes a more effective XML schema that overcomes many of these problems and is intended for use in applications that support the rapid development of spoken language deliverables.

Title:

ONTOLOGY MODELING TOOL USING CONCEPT DICTIONARY AND INFERENCE

Author(s):

Yoichi Hiramatsu

Abstract: The usefulness of ontology is strongly dependent on the knowledge representation policy and its maintenance. The subject of knowledge representation and modeling tool has been one of the exciting themes among ontology scientists. Some ontology editing tools were born and grew up in the field of expert system and others designed originally by ontology research groups. Key features of the newly implemented tool are: (a) reference to concept dictionary (EDR and WordNet) to find out quickly the semantics of the words, and (b) use of inference algorithm provided by Schank’s Memory Organization Package. Satisfactory results were obtained in the application of ontology modeled by the present tool. The paper describes the implementation of the modeling tool and its effectiveness in solving some actual problems of enterprise integration.

Title:

OPEN SOURCE VS. CLOSED SOURCE

Author(s):

Vidyasagar Potdar , Elizabeth Chang , Ljiljana  Brankovic

Abstract: Open source software development represents a fundamentally new concept in the field of software engineering. Comparing to traditional software engineering approach, we found this approach is absolutely reversed. Open source development and delivery occurs on Internet time. Developers are not confined to a geographic area. They work voluntarily on a project of their choice; they do not have to join a particular project just because it needs more developers or the project has a high degree of urgency. Developers work for peer-recognition and self-satisfaction. In the open source community, each and every project has an equal priority. Software designed under open source is not released until the project owner thinks that the software has reached a functional stage. One of the success stories is the Linux operating system. Open Source software is always in an evolutionary stage: it never reaches a final stage. As new requirements emerge the software is enhanced by the user/developers. In this paper, we give an introduction to the insights of open source software development. We then elucidate the perceived benefits and point out the differences between open source and closed source software development approaches. At the end we propose a new model for open source software development.

Title:

USING WORKFLOW TECHNOLOGY: INTEGRATING EXISTING ENTERPRISE SYSTEMS WITH WORKFLOW TECHNOLOGY

Author(s):

Jeanne Stynes , Patrick Rushe

Abstract: Reducing costs and reducing time to market are two major keys to survival in the software market. Workflow reduces costs and time dramatically where applications involve the passage of work between recipients in order to meet certain business objectives. New projects in this area often use workflow technology. However, workflow’s applicability is often overlooked where developers are working on maintaining or upgrading existing systems. This paper discusses the work involved in integrating an existing system with a workflow management system, and examines the benefits of incorporating workflow into existing systems.

Title: