• submit to reddit
Chris Chang01/30/14
3559 views
0 replies

How Big is Your MongoDB?

As your MongoDB grows, information from db.stats() becomes helpful for evaluating hardware requirements. We frequently get questions about the dataSize, storageSize and fileSize metrics, so we want to help developers better understand how MongoDB storage works and what these metrics mean.

Rafał Kuć01/30/14
6167 views
0 replies

Apache Solr and Lucene 4.6.1

Today Apache Lucene and Solr PMC announced another version of Apache Lucene library and Apache Solr search server numbred 4.6.1. This is a minor bugfix release. However, it is a recommended update for those using SolrCloud.

Zac Gery01/30/14
6384 views
0 replies

Why Am I Developing This?!?

Let's set the scene: The project has been humming along, and a new request comes in for a minor change. It's completed in quick order, QA signs off, but the customers are frustrated and confused. To find clarity, stop focusing on what did happen and dig a little deeper into what didn't happen.

Alec Noller01/29/14
7719 views
0 replies

Dev of the Week: Alex Curylo

Every week here and in our newsletter, we feature a new developer/blogger from the DZone community to catch up and find out what he or she is working on now and what's coming next. This week we're talking to Alex Curylo, long-time Mac expert and contract iPhone programmer in Vancouver, BC, Canada.

Wojciech Kudla01/29/14
3757 views
0 replies

How to Serialize an Array of Doubles to a Byte

Low latency systems require high performance message processing and passing. The technique presented in this article is a good example of how leveraging some aspects of the data benefits encoding in both latency and space complexity.

Tim Eisenhauer01/29/14
2819 views
0 replies

Problems with Free and Open Source Collaboration Platforms

Recently, one of my government clients decided to switch their public facing ASP.NET to Drupal (an open source solution), and they had an embarrassing hacking scenario. There are a number of disadvantages to open source systems that you should be aware of before making the switch.

Ayende Rahien01/24/14
8603 views
0 replies

Big Data Search, Part 4: The Index Format is Horrible

The author has finished his own Big Data Search exercise, and it's interesting to see just how far out there the code is. It isn't something that you can use for much except as a basis to see how badly you are doing. Let's start with the index format: A CSV file with the value and the position in the original file.

Alec Noller01/22/14
6852 views
1 replies

Dev of the Week: James Sugrue

Every week here and in our newsletter, we feature a new developer/blogger from the DZone community to catch up and find out what he or she is working on now and what's coming next. This week we're talking to James Sugrue, principal front-end architect at Avego and DZone zone leader.

Vlad Mihalcea01/20/14
8331 views
0 replies

A Beginner’s Guide to MongoDB Performance Turbocharging

This is the second part of our MongoDB time series tutorial, and this post will be dedicated to performance tuning. In my previous post, I introduced you into our virtual project requirements. In short, we have 50M time events, spanning from the 1st of January 2012 to the 1st of January 2013.

Alec Noller01/19/14
8922 views
0 replies

The Best of the Week (Jan. 10): NoSQL Zone

Make sure you didn't miss anything with this list of the Best of the Week in the NoSQL Zone. This week's best include a comparison of various NoSQL databases, a discussion of the myths surrounding the open source movement, a demonstration of MongoDB's real-life usage, and more.

Alec Noller01/19/14
6363 views
0 replies

The Best of the Week (Jan. 10): Big Data Zone

Make sure you didn't miss anything with this list of the Best of the Week in the Big Data Zone. This week's best include a tutorial on getting started with ElasticSearch, Hadoop as the central tool of Big Brother, and a how-to for splitting large XML files in Java.

Julian Exenberger01/16/14
14587 views
0 replies

Builder Pattern Using Java 8

A key feature that Groovy builders have is their hierarchical approach to building constructs. This allows the builders to neatly and safely create nested, tree-like constructs which can be used to model everything from UX form layouts to XML. We can model this approach succinctly using Java 8 lambda expressions.

Krishna Prasad01/16/14
5647 views
0 replies

Sharding, Scaling, Data Storage Methodologies, and More: Insights on Big Data

In this article, the author provides a variety of insights on Big Data, including explanations and comparisons of OLTP and OLAP, data sharding, MPP, vertical and horizontal scaling, CAP Theorem, databases such as Greenplum and Hbase, and a detailed table comparing data storage methodologies.

Mark Needham01/16/14
4704 views
0 replies

Neo4j: Modeling ‘Series’ of Events

One of the things the author never worked out how to model in his football graph is a series of matches that allow him to answer questions like the following: How many goals has Robin Van Persie scored in his last 10 matches in the Barclays Premier League? In this article, the author explores some options.

Alec Noller01/15/14
7785 views
0 replies

Hadoop: The NSA-Fueled Privacy Invasion Machine

Hadoop users, or anybody interested in Big Data, may be interested in this recent article from Salon about the nefarious uses of Hadoop. A significant portion of the article is focused on explaining Hadoop, but then there's more: Hadoop as the central tool of Big Brother.