quick search
GO

quick links

Capri Summary


Corporate Intellect's Capri software is a model for sophisticated data mining. It is a leading Sequence Detection Algorithm currently installed in over 60 companies throughout the world.

Sequence Detection Algorithms discover patterns consisting of frequently co-occurring events in time-stamped data. The data input to the algorithm consists of a set of objects, each being identified by a "Primary Key". Each object has a number of events attributed to it. The event may be a change in commodity price, the purchase of a product, accessing of a web page etc. A “Secondary Key”, often a date-time stamp, defines the order in which the events occurred.

Typical example applications to which Capri has been applied include:

• Money Laundering
• Basket Analysis
• Web Analytics/Personalisation
• Price Discovery Prediction
• Pharmaceutical Research

Capri's Uniqueness | Key Features | Userbase | System Requirements

Capri's Uniqueness


Since it is based on the Apriori association algorithm, Capri finds association rules like GRI or Apriori. The strength of Capri, however, is its ability to discover associations over time. These associations between things that occur together across a set of records (and therefore, over time) are known as sequences.

Capri also enables you to incorporate knowledge of what you are looking for in sequences by allowing you to specify features such as the start and end items for a sequence, sequence length, and time constraints like maximum time allowed between items in the sequence. See Capri Key Features.

back to top

Capri Key Features


Capri has a number of unique features that have helped it maintain its innovative edge over other Sequence Detection Algorithms. These include, but are not limited to:

Highly Scalable
Capri has been successfully applied to gigabytes of data consisting of millions of records.

Ability to incorporate Domain Knowledge
The user can define a taxonomy on each of the attributes describing the events from which the sequence patterns are to be discovered. This allows sequences to be discovered at varying levels of generalisation.

Template description language

Sequence Discovery Algorithms in keeping with the Apriori family of algorithms can discover a large number of sequences. Capri provides an XML based language to describe the patterns of interest in a specific discovery run of Capri. Only sequences matching the templates defined are discovered.

Discrete and Continuous Events.
Sequence detection algorithms generally deal with discrete events. Capri has been extended to be usable will continuous valued series and takes input data in the form:

Date/Time Commodity 1 Price Commodity 2 Price
08/09/2001 09:35 27.59209 25.61969
08/10/2001 09:35 28.11783 26.09171
08/13/2001 09:35 27.85907 26.09961
08/14/2001 09:35 27.94062 25.87783
08/15/2001 09:35 27.62899 26.04845
08/16/2001 09:35 27.37411 25.6124
08/17/2001 09:35 26.85992 25.29884











This makes Capri applicable to the financial sector where data is generally a continuous valued series.

Multi-Variate sequence detection
Capri can discover sequence pattern in data where there is more than one attribute describing the event and the goal is to find sequence patterns relating to the occurrence of particular attribute values within each event.

PMML and XML
Capri provides XML representations of the data input and results that ease its integration with other software systems.

PMML is a standard developed by Corporate Intellect in collaboration with Oracle, IBM, SPSS, SAS, Magnify and other key data-mining vendors. The goal of PMML is to represent knowledge discovered by data mining algorithms in an open XML-based standard enabling interoperability of models between data mining vendors, knowledge generation providers and knowledge consumer software systems in general.

Complex Sequence Pattern Discovery
Capri provides the widest range of parameters to its users allowing them to have the flexibility to discover more specific sequence types based on their needs.


Unique Visualisation
Visualisation of the output sequence is enabled through a 3-dimenional cone-tree visualiser that describes the event-space and displays individual sequences on the structure presented. The event space can be defined by the user in XML format or extracted from the sequences discovered.


back to top

Capri Userbase



Capri provides a valuable data-mining algorithm for the following vertical markets:

• Telecommunications
• Health Care
• Banking Finance
• Insurance
• Re-Insurance
• Manufacturing
• Retail
• Consumer Packaged Goods
• Market Research
• Public Sector
• Academia

back to top

Capri System Requirements



Capri is supported on the Sun Solaris and the Microsoft platforms. The system requirements for installing and running Capri are:

Hardware Pentium-compatible processor or higher and a monitor with 1024 x 768 resolution or higher (support for 65,536 colors is recommended). A CD-ROM drive for installation is also required.

Operating system Windows 98, Windows 2000, or Windows NT 4.0 with Service Pack 6 or higher. Solaris 2.6 is also supported.

Min free disk space 5MB is required for CAPRI.

Min RAM 128MB or more of RAM

back to top

 Our highly skilled team of designers, developers and SEO experts have been involved in the research and development of business solutions for over 10 years.


Chairman
Finisco Group Limited