Anatella

ANATELLA
DATA IN MOTION

Change the way you think about your data.

WELCOME TO ANATELLA

What is Anatella?

At first sight, Anatella looks like a user-friendly data management (ETL) tool but it’s much more than that. Anatella is the center of the TIMi Analytical platform.

Read more

Key features

With Anatella, you can solve the most advanced machine learning problems. Anatella is also a collaboration platform that allows easy collaboration between “expert coders” and less technical “business” users.

All functionnalities

Support and training

Quick start guide, video tutorials, documentation, training sessions and other ressources for Anatella.

Support center

Download Anatella

Download Anatella with the “TIMi Community Edition” or the “Business Edition trial”. Free to use. No limit, no constraint.

Download Anatella

DATA MANAGEMENT MADE EASY

A platform built for Citizen Data Scientists

Forget complex, costly, lengthy and code intensive data management procedure.
Thanks to Anatella, the business-oriented users can easily collaborate
with the most technical users to produce outstanding value.

2018-12-18_150519

PERFORMANCE

Never wait anymore for a data transformation. Anatella handles tables with billions of rows and thousands of columns even on small infrastructures.

EFFICIENCY

Develop complex data transformations faster using an intuitive interface that requires no code and save your data transformations for easy re-use.

VERSATILITY

Easily extract, clean, aggregate and join all kinds of datasets and inject them into a RDBMS, a BI tool, a modeling tool or into R/Python.

MODULARITY

Create or import customised tranformations in R, Python and Javascript using the built-in plugin system and the collaborative framework.

ANATELLA FUNCTIONALITIES

Fast data management

This typically includes: Extract different Datasets from various storage types and various locations ; Cleaning and Validating your Dataset; Compute many different aggregates on your datasets ; Join several datasets together (you know: those two databases that were never supposed to be joined together because they lack a proper common key?) ; Injecting your datasets into a RDBMS, into a BI tool, into a modeling tool, into R/Python.

Read more

Data quality

With Anatella, you can easily perform any data quality and data cleaning tasks on large data volumes (handle tables with several billions rows using only one laptop).

Read more

R and Python integration

All data scientists have their preferred set of tools and libraries that they are used to. Why impose a specific language or library? Anatella can seamlessly integrate inside the same data transformation graph many pieces of code programmed in either R, Python or Javascript. It’s the ultimate flexibility and it doesn’t even arrive at a cost: Anatella has one of the fastest (maybe the fastest) R/Python engine.

Read more

Collaboration framework

Anatella offers an abstraction layer around the code that allows analysts and coders to communicate smoothly and make progress in a collaborative way. Thanks to the abstraction layer, the analyst don’t see any line of code while the coder are still writing code. Both groups can still think together and bring meaningful contributions on how to solve the problem at hand.

Read more

Simple “Static” Reporting

Anatella is fully integrated with the MS-Office Suite. In a few mouse clicks, Anatella can read, write or update MS-Excel .xlsx files at very high speed (using its own proprietary code). Anatella allows you to automatically update all the charts and graphs of all your MS Office reports (without using any un-maintainable VBA scripts!). For example, Anatella can automatically update the charts in your Powerpoint report, for easy reporting to your C-Level.

Read more

BI integration and OLAP

Anatella is the ideal companion to any & all BI tools (such as: Tableau, Qlik, Kibella and Kibana). Anatella automatically generates and updates the datasets used inside your BI solution. Anatella can natively create at high speed the internal, proprietary file format used by Tableau, Qlik, Kibella and Kibana. It means that refreshing the data “behind” your dashboard has never been so easy.

Read more

Built for Machine Learning

Anatella includes natively strong Text Mining capabilities that can be even more extended thanks to the R/Python integration. Anatella also includes many functionalities tailored for predictive modeling such as feature engineering, meta-data-free transformations, graph-mining, etc.

Read more

Built For Iterative Work

BI, Analytics and Predictive Analytics projects are characterized by their “exploratory” nature. That’s why Anatella allows quick iteration over different variations of your data transformations.

Read more

Graph Mining

Anatella is able to run SNA (Social Network Analytics) and Graph Mining algorithms on the very large graphs available inside the telecom & banking companies. Using Graph Mining algorithm, detect Communities, detect Social Leaders, compute Graph kernels, etc. on graphs with several dozens millions of nodes and several billions of arc.

Read more

Data Lake

Anatella not only solves the “data in motion” problem but it also solves the “data storage problem”. Thanks to two proprietary file formats (.gel_anatella and .cgel_anatella), Anatella can store vast amount of data without requiring much diskspace. For example, you can store a 5TB RDBMS data base in less than 100GB of hard drive space and still have incredible I/O performances while manipulating your datasets.

Furthermore, from within Anatella, the Hadoop HDFS drive is seen as a “normal”, local drive (e.g. you can read/write/browse your HDFS drive “as if” it was a local drive): It means that it’s as easy to store all your dataset files on a HDFS drive, as to store your files on a highly efficient NAS/SAN/RAID6/”C:” drive. An HDFS drive offers unlimited storage at a low price.

Read more

Hadoop Integration

Thanks to the direct Hadoop Integration, creating a centralized Data Lake on the HDFS drive with Anatella is a breeze. All the tools inside the Hadoop ecosystem are reading and writing their data inside files inside the HDFS drive. The file format of the files saved on an HDFS drive are, typically: Text files, .parquet files, .avro files (and also .gel_anatella or .cgel_anatella files when using Anatella)

Anatella use low-level C code to read/write parquet files directly from/to a HDFS drive. This makes Anatella several times faster than any other tool inside the Hadoop ecosystem.

Read more

Data Warehousing

Anatella connects natively to any “legacy” data source (e.g. old SAS .sas7bdat file, old AS-400 Main Frame files created in Kobol, old DBase or FoxPro databases, etc.) and it also connects to the “new wave” of Big Data solution (HDFS) and IoT solutions (real-time). This means that Anatella is the perfect tool to integrate all these different (new and old) technologies.

For example, with Anatella, in a few mouse click, you can feed all your “HDFS datasets” inside your Data WareHouse (based on Oracle, Teradata, etc.) and the other way around.

Read more

Straightforward&Fast Industrialization of R&D findings

Once a data transformation is ready for industrialization, you can deploy it on your (pre-)production server/cluster in a few mouse-clicks.

Anatella scripts can be executed in command line, via REST API or via any scheduler such as Jenkins. It runs on both Windows and Linux servers.

Read more

IoT integration and Real-Time streaming

Anatella data transformations can work both in classical “batch mode”, but also in real-time streaming mode (although not all data transformation operators are available when working in streaming mode). This means that direct bi-directional connections to common IoT brokers such as Kafka, RabbitMQ, Mosqitto is straightforward and easy. Thanks to the load-balancing included in such brokers, Anatella can sustain a practically unlimited amount of simultaneous connections (just add more nodes if you need more speed).

Read more

No Cloud, No In-Memory

The selection of technologies used in Anatella is based on efficiency rather than popularity. This means that Anatella is not an “in-memory” solution. The dataset sizes that you can manipulate with Anatella are only limited by the size of your (local or HDFS) drive and not by your RAM. This limit is furthermore reduced thanks the usage of a highly-compressed proprietary file format available in Anatella (.gel_anatella and .cgel_anatella files) that allows to store Terabytes of data in a few dozen gigabytes.

Anatella doesn’t rely on a large cloud infrastructure to provide the necessary computing power to process these vast amount of data. Of course, you can still use Cloud/Distributed computation or storage with Anatella. Most of the time, Anatella’s engine is so efficient (both in terms of computation and storage) that one ordinary laptop is more than enough to handle all the tasks at hand.

2 Examples of fast ETL on a laptop

Non intrusive & Easy deployment

You can install and run your data-transformation scripts on any Windows-based PC (or inside “Vine” on Linux).

Anatella has a small, automated & easy wizard-based installation system that installs Anatella in less than a minute. You can run your Anatella data-transformation Scripts on any PC, even with restricted system privileges.

There even exists a portable version of Anatella that requires no installation (i.e. it’s a simple ZIP file to unzip). The portable version guarantees that no bloatware will ever be installed on your machine when “installing” Anatella since there is no real installation: i.e. All you do is unzipping a ZIP file. The portable version is also very handy if you are working as an external consultant at your client site and want to quickly process some data. With the portable version of Anatella on a USB stick, you can run your Anatella data-transformation graphs on any PC, even with very restricted system privileges. There is no excuse to not use Anatella everywhere!

Low licensing cost

There are no licensing fee based on the volume of processed data (i.e. There is no “data tax”). Furthermore, the Community Edition of Anatella is totally free and covers more than 95% of the usual business cases.

You only pay for additional functionalities, maintenance and support. Not for volume. This type of licensing model is important for growing businesses, with an increasing amount of data. If your business grows, with Anatella, you can still process your data as often as you need.

Testimonials

Bouygues Telecom
Telecom operator in France

“The optimal solution to extract advanced Social Network Algorithms metrics out of gigantic social data graphs.”

VOO
Telecom operator in Belgium

“We reduced by 10% the churn on the customer-segment with the highest churn rate.”

Digicel
Mobile network provider

“TIMi framework includes a very flexible ETL tool that swiftly handles terabyte-size datasets on an ordinary desktop computer.”