Академический Документы
Профессиональный Документы
Культура Документы
Given the buzz around Spark Streaming & Storm, they can seem like
obvious choices for supporting streaming analytics. However, most of
our customers have struggled to take both Spark Streaming & Storm
beyond the proof-of-concept stage as they address the enterprise
objectives too narrowly to offer a complete solution. Enterprises
require an easy to use, visual tools-based approach that works out of
the box. The platform needs to meet the needs of data scientists,
developers and the data center operations teams without needing
extensive & expensive patchwork of custom code & third party
software that often fails
DataTorrent RTS is the industrys first fully Hadoop native streaming
analytics solution. DataTorrent RTS provides an enterprise grade
streaming analytics platform, delivers tools and pre-built analytics
modules and lights out data center operational capabilities.
www.datatorrent.com
www.datatorrent.com
What to ask
To ensure an enterprise grade solution that meets your organizations SLA
requirements, ask the following questions of your proposed solution:
If Hadoop is your core big data platform, does your streaming platform
seamlessly use HDFS for raw data & application state checkpoints & engine
state management to reduce dependence on external datastores like
relational databases that do not scale? Also, does your streaming platform
run natively on YARN for scheduling without having to deal with making the
underlying streaming platform scheduler work well with YARN as that can
cause significant multi-tenancy & operational issues?
Can the streaming analytics solution auto-scale and process increased data
loads without manual programming and re-deployment?
Does the streaming analytics platform guarantee the processing order of
your events across all processing guarantees at-most once, at-least once &
exactly once without having to micro-batch the input data?
Is the streaming analytic solutions fault tolerance complete (raw events, app
state & engine state), abstracted from the developer and done natively in
Hadoop using HDFS?
Streaming analytics applications need to be able to handle events non-stop.
Does your streaming analytics solution support dynamic updates to
application properties and business logic with no application downtime?
www.datatorrent.com
What to ask
To ensure that data scientists and developers can rapidly assemble applications, ask
the following questions of your proposed solution:
Does the streaming analytics solution have connectors to support faulttolerant & auto-scaling data ingestion & distribution for all of your data
sources & analytics destinations out of the box?
Are common data analytics capabilities such as joins, aggregations, and
statistical analysis available out-of-the-box? How about complex capabilities
such as dimensional cube creations and integration with machine learning
tools?
Does the solution aggregate data over varying windows, both static and
rolling, automatically, or does the developer have to manually implement?
Is the solution data scientist and business analyst friendly with a visual
application creation and data visualization tools?
www.datatorrent.com
What to ask
Does your organization require easy to use tools for the full application
deployment & management operations cycle?
Are visual, automated alerting and command line tools required for your
data center operations team?
Does the streaming analytic solution have built in capabilities to make
application modifications dynamically?
www.datatorrent.com
Conclusion
Enterprises are seeing greater opportunity to better serve their customers, drive
greater revenues and reduce costs through operational efficiencies. In order to
capitalize on the opportunity, organizations are looking for solutions that enable rapid
insights and action to be taken on fast big data. An enterprise-grade solution is
required that meets the needs of data scientists, developers and data center
operations.
The top 3 reasons that enterprises are deploying DataTorrent RTS over Spark
Streaming are summarized below.
Enterprise-grade streaming analytics platform
Industrys first Hadoop-native, fully multi-tenant YARN and HDFS based
architecture
No data loss with automatic fault tolerance for raw event data, application
state & engine state
Visual application creation tool that utilizes the 450+ open source Java
operators
Ability to ingest data from and distribute to any source with more than 75
pre-built adaptors
Additional Resources
DataTorrent RTS: Data sheet
DataTorrent RTS Whitepaper
DataTorrent download
DataTorrent Inc.,
3200 Patrick Henry Drive
nd
2 Floor
Santa Clara CA 95054
+(1) 408-331-5034, ext #101
www.datatorrent.com