Hadoop news

Hadoop


Top Stories

In this  article  I would  like to look at a few tools which are overlooked when it comes to Big Data analytics. Organizations that  have  already  heavy investment  on Mainframe  and  would like to continue  with the utilization of Mainframe can consider these  tools for further  expanding their Big Data Analytics reach. DFSORT-  Sorting & Merging Large Data Sets : Much before RDBMS have taken their place, Cobol programs have 2 major file manipulation operations namely: SORT operation accepts un-sequenced input and produces output in specified sequence The Merge operation compares records from two or more files and combines them in order DFSORT adds the ability to do faster and easier sorting, merging, copying, reporting and analysis of your business information, as well as versatile data handling at the record, fixed position/length or variable position/length fi... (more)

In 2014 Big Data Investments Will Account for Nearly $30 Billion - Eventually Accounting for $76 Billion by 2020 End

DALLAS, Aug. 21, 2014 /PRNewswire-iReach/ -- Amid the proliferation of real time data from sources such as mobile devices, web, social media, sensors, log files and transactional applications, Big Data has found a host of vertical market applications, ranging from fraud detection to R&D. Photo - http://photos.prnewswire.com/prnh/20140821/138541 "Big Data Market: 2014 – 2020 – Opportunities, Challenges, Strategies, Industry Verticals & Forecasts" Key Findings: In 2014 Big Data vendors will pocket nearly $30 Billion from hardware, software and professional services revenues Big Data investments are further expected to grow at a CAGR of nearly 17% over the next 6 years, eventually accounting for $76 Billion by the end of 2020 The market is ripe for acquisitions of pure-play Big Data startups, as competition heats up between IT incumbents Nearly every large scale IT ven... (more)

Big Data Analytics: Thinking Outside of Hadoop

Big Data Predictions In the recent release of '2012 Hype Cycle Of Emerging Technologies,' research analyst Gartner evaluated several technologies to come up with a list of technologies that will dominate the future . "Big Data" related technologies form a significant portion of the list, in particular the following technologies revolve around the concept and usage of Big Data. Social Analytics: This analytics allow marketers to identify sentiment and identify trends in order to accommodate the customer better. Activity Streams: Activity Streams are the future of enterprise collaboration, uniting people, data, and applications in real-time in a central, accessible, virtual interface. Think of a company social network where every employee, system, and business process exchanged up-to-the-minute information about their activities and outcomes Natural Language Question A... (more)

Beyond Big Data and @Benioff’s “AI Spring” to the Dawn of Dataware By @MattMcIlwain | @CloudExpo [#BigData]

This article was authored by Matt McIlwain and was originally published on Medium. For more of Matt's writing, you can follow him here! Guest Post: Beyond Big Data and Benioff’s “AI Spring” to the Dawn of Dataware Big Data, AI, Machine Learning, Hadoop, Predictive Analytics — we hear these terms every day from companies such as Cloudera, Trifacta and Dato (formerly GraphLab) that are securing many millions in financing. I believe that 2015 will be the year when the conversation moves from Big Data to the Dataware stack. Over the past twelve months we have seen a lot of companies across the big data spectrum emerge and while the language can be the same, there are clear product categories that have emerged which describe the market opportunity and future growth. This is the Dataware stack. Dataware is the combination of infrastructure, data intelligence systems tha... (more)

INOVVO Uses Big Data for Customer Satisfaction By @Dana_Gardner | @BigDataExpo #BigData

The next BriefingsDirect big-data case study discussion examines how INOVVO delivers impactful network analytical services for mobile operators to help them engender improved end-user loyalty. We'll see how advanced analytics, drawing on multiple data sources, enables INOVVO’s mobile carrier customers to provide mobile users with faster, more reliable, and relevant services. To learn more about how INOVVO uses big data to make major impacts on mobile services, please join me in welcoming Joseph Khalil, President and CEO of INOVVO in Reston, Virginia. The discussion is moderated by me, Dana Gardner, Principal Analyst at Interarbor Solutions. Here are some excerpts: Gardner: User experience and quality of service are so essential nowadays. What has been the challenge for you to gain an integrated and comprehensive view of subscribers and networks that they're on in or... (more)

Oracle Fills Another Gap in Its Big Data Offering

When we last left Oracle’s big data plans, there was definitely a missing piece. Oracle’s Big Data Appliance as initially disclosed at last fall’s OpenWorld was a vague plan that appeared to be positioned primarily as an appliance that would accompany and feed data to Exadata. Oracle did specify some utilities, such as an enterprise version of the open source R statistical processing program that was designed for multithreaded execution, plus a distribution of a NoSQL database based on Oracle’s BerkeleyDB as an alternative to Apache Hive. But the emphasis appeared to be extraction and transformation of data for Exadata via Oracle’s own utilities that were optimized for its platform. With Oracle’s announcement of general availability of the big data appliance, it is filling in the blanks. As such, Oracle’s plan for Hadoop was competition, not for Cloudera (or Horto... (more)

Big Moves in Big Data: EMC's Hadoop Strategy

To date, Big Storage has been locked out of Big Data. It’s been all about direct attached storage for several reasons. First, Advanced SQL players have typically optimized architectures from data structure (using columnar), unique compression algorithms, and liberal usage of caching to juice response over hundreds of terabytes. For the NoSQL side, it’s been about cheap, cheap, cheap along the Internet data center model: have lots of commodity stuff and scale it out. Hadoop was engineered exactly for such an architecture; rather than speed, it was optimized for sheer linear scale. Over the past year, most of the major platform players have planted their table stakes with Hadoop. Not surprisingly, IT household names are seeking to somehow tame Hadoop and make it safe for the enterprise. Up ' til now, anybody with armies of the best software engineers that Internet fir... (more)

Is There a Place for Big Data in SMB Markets?

Is Big Data destined for only the top 3,000 companies worldwide? What about medium or small companies who are equally as data-driven? Is there a place for Big Data in SMB markets? When I talk to SMB companies about their use of public cloud services, it’s a no-brainer. Pay as you go, lower costs upfront, quick time-to-market. With private and public cloud solutions, both big and small companies benefit. So what about big data platforms? Is there an equivalent opportunity, or is Big Data only suited for Big Companies with Big Problems? I think Big Data applies to all, and here’s why. OLTP & DSS Era When I first started working on Teradata’s next-generation switch fabric, the BYNET, back in the early 90s, Teradata was around $250M in revenue (now over $2B). The concept of decision support systems (DSS) evolved into data warehousing and grew to become the largest pool ... (more)

Jaspersoft Delivers Big Data Integration into Jaspersoft ETL

LAS VEGAS, Feb. 16, 2012 /PRNewswire/ -- TDWI 2012 Conference -- Jaspersoft, maker of the world's most widely used business intelligence (BI) software, today announced an upgraded original equipment manufacturing (OEM) agreement with Talend to include native connectors to Apache Hadoop Big Data environments in Jaspersoft ETL. With this enhanced ETL offering, Jaspersoft uniquely offers CIOs, data scientists, and BI Builders the flexibility of three options to harness Big Data: direct reporting, direct real-time analysis, and batch analysis through ETL data mart access. Big Data sets can traditionally take data scientists hours to extract, transform, and load. Jaspersoft ETL significantly shortens the time it takes to connect to Big Data environments, eliminating the latency of other solutions and allowing insights to be uncovered faster than ever before. Jaspersoft ... (more)

Enabling Next Generation IO Architecture for Big Data at Cloud Expo NY

Cloud Expo New York $500 Savings here! More and more enterprises are deploying cloud computing and IT departments are requiring new capabilities with cloud storage infrastructure. Hadoop and Map Reduce have been the popular applications for handling distributed unstructured data. In his session at the 10th International Cloud Expo, Ilker Cebeli, Director of Cloud Computing at QLogic, will explain how Big Data is already impacting storage and networking architecture in the data centers and how to build optimized storage and networking infrastructure for the future. He will also explain how to leverage cloud storage while maintaining iSCSI and Fibre Channel local storage infrastructure. Explore Cloud Expo Sponsorship & Exhibit Opportunities ! Speaker Bio: Ilker Cebeli is Director of Cloud Computing at QLogic. He is responsible for leading and directing the cloud stra... (more)

Big Data Analytics: Datameer

Datameer provides a big data solution that focuses on perhaps the most important niche in this growing domain, the end-user. Here is an introduction from their website: Datameer leverages the scalability, flexibility and cost-effectiveness of Apache Hadoop to deliver an end-user focused analytics platform for big data. Datameer overcomes Hadoop’s complexity and lack of tools by providing analysts and technical users with pre-built functionality across data integration, analytics and data visualization of any volume and types of data. I’ve met with the CEO (Stefan Groschupf) and other Datameer executives. I’ve also interacted with them in events like our Government Big Data Forum. Through these events plus demonstrations by some of their greatest engineers has led me to a few conclusions about Datameer. In general, I believe enterprise technologists should take not... (more)