Organizations are increasingly opting for an analyze-then-store method for streaming data, according to a new survey by Swim.
Swim, the developer of a platform for continuous intelligence applications, found that almost half of all respondents to its quarterly streaming data survey analyze data in real-time, instead of storing it first.
It is an improvement on previous surveys and shows the industry is moving towards an analyze-then-store method, to avoid data staleness. The survey found that 16 percent of organizations perform contextual analysis in stream, lowering the time from streaming data entering the system to analysis even more.
SEE ALSO: Where is Streaming Data Analytics at Its Most Tangible?
Some data does not need to be analyzed in real-time. 32 percent of organizations store data in a warehouse or lake and process it later in batches, to gain contextual insights.
Continuous processing and streaming data has many key advantages. According to organizations surveyed, the most important advantage is improved customer experience (44%), followed by early warning of outage or threat (36%) and more efficient supply chain (36%).
Most use streaming technology for more than four purposes. The main ones are data pipelines (54%), messaging (54%) and microservices (52%). One third of organizations that responded said they use it for streaming applications.
21 percent of organizations are processing over one million messages per second, a staggering amount to process and analyze. Two percent of organizations are processing over 10 million messages a second.
70 percent of organizations build their own custom streaming data solutions. 50 percent build the full stack with available open source infrastructure and 20 percent operate on top of a pre-integrated commercial platform. The other 30 percent outsource to commercial platforms for all stages.
Apache Kafka is by far the most popular open source streaming technology, with 83 percent of organizations using it. Apache Pulsar is second place, with 17 percent usage. Apache Cassandra, Apache ActiveMQ and Apache Spark Streaming were also cited as popular frameworks.
Computer hardware, software and SaaS companies are leading the way in processing and analyzing streaming data, accounting for 20 percent of those surveyed. Retail/e-commerce and financial were joined second in adoption.
As the shift to more edge computing continues, we expect to see more adoption of analytical streaming capabilities and an effort to analyze in real-time or at source. Through this, organizations will have a continuous feedback loop, instead of weekly or monthly batch updates, leading to quicker and precise decisions.