DATA IN MOTION
Change the way you think about your data.
WELCOME TO ANATELLA
What is Anatella?
At first sight, Anatella looks like a user-friendly data management (ETL) tool but it’s much more than that. Anatella is the center of the TIMi Analytical platform.Read more
With Anatella, you can solve the most advanced machine learning problems. Anatella is also a collaboration platform that allows easy collaboration between “expert coders” and less technical “business” users.All functionnalities
Support and training
Quick start guide, video tutorials, documentation, training sessions and other ressources for Anatella.Support center
Download Anatella with the “Timi Community Edition” or the “Business Edition trial”. Free to use. No limit, no constraint.Download Anatella
DATA MANAGEMENT MADE EASY
A platform built for Citizen Data Scientists
Forget complex, costly, lengthy and code intensive data management procedure.
Thanks to Anatella, the business-oriented users can easily collaborate
with the most technical users to produce outstanding value.
Never wait anymore for a data transformation. Anatella handles tables with billions of rows and thousands of columns even on small infrastructures.
Develop complex data transformations faster using an intuitive interface that requires no code and save your data transformations for easy re-use.
Easily extract, clean, aggregate and join all kinds of datasets and inject them into a RDBMS, a BI tool, a modeling tool or into R/Python.
Fast data management
This typically includes: Extract different Datasets from various storage types and various locations ; Cleaning and Validating your Dataset; Compute many different aggregates on your datasets ; Join several datasets together (you know: those two databases that were never supposed to be joined together because they lack a proper common key?) ; Injecting your datasets into a RDBMS, into a BI tool, into a modeling tool, into R/Python.
With Anatella, you can easily perform any data quality and data cleaning tasks on large data volumes (handle tables with several billions rows using only one laptop).
R and Python integration
Anatella offers an abstraction layer around the code that allows analysts and coders to communicate smoothly and make progress in a collaborative way. Thanks to the abstraction layer, the analyst don’t see any line of code while the coder are still writing code. Both groups can still think together and bring meaningful contributions on how to solve the problem at hand.
Simple “Static” Reporting
Anatella is fully integrated with the MS-Office Suite. In a few mouse clicks, Anatella can read, write or update MS-Excel .xlsx files at very high speed (using its own proprietary code). Anatella allows you to automatically update all the charts and graphs of all your MS Office reports (without using any un-maintainable VBA scripts!). For example, Anatella can automatically update the charts in your Powerpoint report, for easy reporting to your C-Level.
BI integration and OLAP
Anatella is the ideal companion to any & all BI tools (such as: Tableau, Qlik, Kibella and Kibana). Anatella automatically generates and updates the datasets used inside your BI solution. Anatella can natively create at high speed the internal, proprietary file format used by Tableau, Qlik, Kibella and Kibana. It means that refreshing the data “behind” your dashboard has never been so easy.
Built for Machine Learning
Anatella includes natively strong Text Mining capabilities that can be even more extended thanks to the R/Python integration. Anatella also includes many functionalities tailored for predictive modeling such as feature engineering, meta-data-free transformations, graph-mining, etc.
Built For Iterative Work
BI, Analytics and Predictive Analytics projects are characterized by their “exploratory” nature. That’s why Anatella allows quick iteration over different variations of your data transformations.
Anatella is able to run SNA (Social Network Analytics) and Graph Mining algorithms on the very large graphs available inside the telecom & banking companies. Using Graph Mining algorithm, detect Communities, detect Social Leaders, compute Graph kernels, etc. on graphs with several dozens millions of nodes and several billions of arc.
Anatella not only solves the “data in motion” problem but it also solves the “data storage problem”. Thanks to two proprietary file formats (.gel_anatella and .cgel_anatella), Anatella can store vast amount of data without requiring much diskspace. For example, you can store a 5TB RDBMS data base in less than 100GB of hard drive space and still have incredible I/O performances while manipulating your datasets.
Furthermore, from within Anatella, the Hadoop HDFS drive is seen as a “normal”, local drive (e.g. you can read/write/browse your HDFS drive “as if” it was a local drive): It means that it’s as easy to store all your dataset files on a HDFS drive, as to store your files on a highly efficient NAS/SAN/RAID6/”C:” drive. An HDFS drive offers unlimited storage at a low price.
Thanks to the direct Hadoop Integration, creating a centralized Data Lake on the HDFS drive with Anatella is a breeze. All the tools inside the Hadoop ecosystem are reading and writing their data inside files inside the HDFS drive. The file format of the files saved on an HDFS drive are, typically: Text files, .parquet files, .avro files (and also .gel_anatella or .cgel_anatella files when using Anatella)
Anatella use low-level C code to read/write parquet files directly from/to a HDFS drive. This makes Anatella several times faster than any other tool inside the Hadoop ecosystem.
Anatella connects natively to any “legacy” data source (e.g. old SAS .sas7bdat file, old AS-400 Main Frame files created in Kobol, old DBase or FoxPro databases, etc.) and it also connects to the “new wave” of Big Data solution (HDFS) and IoT solutions (real-time). This means that Anatella is the perfect tool to integrate all these different (new and old) technologies.
For example, with Anatella, in a few mouse click, you can feed all your “HDFS datasets” inside your Data WareHouse (based on Oracle, Teradata, etc.) and the other way around.
Straightforward&Fast Industrialization of R&D findings
Once a data transformation is ready for industrialization, you can deploy it on your (pre-)production server/cluster in a few mouse-clicks.
Anatella scripts can be executed in command line, via REST API or via any scheduler such as Jenkins. It runs on both Windows and Linux servers.
IoT integration and Real-Time streaming
Anatella data transformations can work both in classical “batch mode”, but also in real-time streaming mode (although not all data transformation operators are available when working in streaming mode). This means that direct bi-directional connections to common IoT brokers such as Kafka, RabbitMQ, Mosqitto is straightforward and easy. Thanks to the load-balancing included in such brokers, Anatella can sustain a practically unlimited amount of simultaneous connections (just add more nodes if you need more speed).
No Cloud, No In-Memory
The selection of technologies used in Anatella is based on efficiency rather than popularity. This means that Anatella is not an “in-memory” solution. The dataset sizes that you can manipulate with Anatella are only limited by the size of your (local or HDFS) drive and not by your RAM. This limit is furthermore reduced thanks the usage of a highly-compressed proprietary file format available in Anatella (.gel_anatella and .cgel_anatella files) that allows to store Terabytes of data in a few dozen gigabytes.
Anatella doesn’t rely on a large cloud infrastructure to provide the necessary computing power to process these vast amount of data. Of course, you can still use Cloud/Distributed computation or storage with Anatella. Most of the time, Anatella’s engine is so efficient (both in terms of computation and storage) that one ordinary laptop is more than enough to handle all the tasks at hand.2 Examples of fast ETL on a laptop
Non intrusive & Easy deployment
You can install and run your data-transformation scripts on any Windows-based PC (or inside “Vine” on Linux).
Anatella has a small, automated & easy wizard-based installation system that installs Anatella in less than a minute. You can run your Anatella data-transformation Scripts on any PC, even with restricted system privileges.
There even exists a portable version of Anatella that requires no installation (i.e. it’s a simple ZIP file to unzip). The portable version guarantees that no bloatware will ever be installed on your machine when “installing” Anatella since there is no real installation: i.e. All you do is unzipping a ZIP file. The portable version is also very handy if you are working as an external consultant at your client site and want to quickly process some data. With the portable version of Anatella on a USB stick, you can run your Anatella data-transformation graphs on any PC, even with very restricted system privileges. There is no excuse to not use Anatella everywhere!
Low licensing cost
There are no licensing fee based on the volume of processed data (i.e. There is no “data tax”). Furthermore, the Community Edition of Anatella is totally free and covers more than 95% of the usual business cases.
You only pay for additional functionalities, maintenance and support. Not for volume. This type of licensing model is important for growing businesses, with an increasing amount of data. If your business grows, with Anatella, you can still process your data as often as you need.
“The optimal solution to extract advanced Social Network Algorithms metrics out of gigantic social data graphs.”
“We reduced by 10% the churn on the customer-segment with the highest churn rate.”
“TIMi framework includes a very flexible ETL tool that swiftly handles terabyte-size datasets on an ordinary desktop computer.”