Map Once, Deploy Anywhere

 

Next Steps
Overview Package Protect Provision Structured Data Semi & Unstructured Data With/out Hadoop Use Cases IoT Bootcamp BDaaS

To Hadoop or Not. The Choice is Easier Than You Think.

To manipulate big data, most people think they need a new IT fabric like Hadoop or Teradata, an in-memory or columnar database like SAP HANA or Vertica, a DB or ELT appliance like Exadata or Netezza, or a complex ETL tool like Informatica or Ab Initio. Do you have the time, money and expertise for them?

What if there were a simpler and more affordable fast processing and governance platform for big data that exploited existing file system and HDFS data and engines interchangeably? 

Screenshot of Workbench

There is. Whether your data sources are in a standard Unix, Linux or Windows file system, in HDFS, or managed in the proprietary systems above, you can process that data in the IRI Voracity platform using either the long-proven IRI CoSort engine or Hadoop engines interchangeably. Without coding or changing anything, your jobs share the same simple, accessible metadata layer, and a free Eclipse IDE for managing it with graphical job design and execution modes, called IRI Workbench.

How do you want to multi-process big data workloads?

Click to see the seamless processing choices that only IRI Voracity delivers for big data transformation, masking, and generation:

An elephant representing Hadoop Without Hadoop
A circle with a line through it With Hadoop
X

Without Hadoop

IRI CoSort jobs running alone or inside Voracity platform projects gives you a 40-year proven alternative to Hadoop for fast, intuitive, inexpensive, and non-disruptive data manipulation. It precludes the skills gap and support costs of Hadoop, and it does not require the time, money, or manpower other systems do that work with big data. CoSort is a low-cost, low-impact, and low-risk option essential for small and medium-sized business, or enterprise line of business teams that love its multi-terabyte processing performance.

With Hadoop

IRI Voracity leverages the performance, scalability, load balancing, and automatic failover capabilities of MapReduce 2 (MR2), Spark, Spark Stream, Storm, and Tez. Voracity runs most CoSort (SortCL) data transformation and masking jobs in these engines based on availability and need. Voracity works in Cloudera, HortonWorks, and MapR distributions. IRI can also provide its own Hadoop distribution on-premise or in the cloud, and on request, in a hardware appliance that includes everything. This article shows how to run Voracity jobs in Hadoop.

With Voracity, it's no longer a matter of homogeneous data processed heterogeneously, or vice versa. It's about having a seamless, unified, metadata-driven enterprise information architecture ... one that gives you control over different data sources and processing engines ... and one that meets changing data integration, governance, and analytic needs.

Learn More

A Big Data Quandry

A server room

Big data volumes are growing exponentially, and simply throwing hardware at it isn't a complete or reliable long-term solution. IRI's proven strategies and software, however, are.

Read the Article

What is Hadoop?

Hadoop overview schematic

Hadoop is an increasingly popular computing environment for distributed processing that businesses can use to analyze and store huge amounts of data.

Read the Article

When to Use Hadoop

A screenshot of IRI Workbench run configurations

Hadoop isn't a one-size-fits-all framework. You need to know when and how to use it. Voracity makes short work of Hadoop job design and deployment when you need it.

Read the Article

See Also

Embracing Velocity with Voracity DBTA article analyzing the speed of IRI Voracity. A timelapse of a high speed city at night
Share this page

Request More Information

Live Chat

* indicates a required field.
IRI does NOT share your information.

X

Try Voracity Free

Big Data Speed & Security. Simple and Seamless


Get Info See Demo