#bytescrolls
Pages
(Move to ...)
Home
TIMELINE
▼
Apr 27, 2017
Run Datastax Graph in a docker container for windows hosts
›
#docker #datastax There are some docker images available here . If you don't have docker, download and install docker toolbox from her...
Apr 15, 2016
A simple recommender system for your e-commerce store using a graph database
›
#graph #recommendation #orientdb #e-commerce #etl In the last post, I have introduced you to a simple ETL use case for graph database lik...
1 comment:
Apr 12, 2016
Orient DB - A simple ETL use case note
›
#orientdb #graph #etl #java #database As someone who is familiar with graph data structures would like to know how we can map real-world ...
3 comments:
Dec 4, 2015
Analytics by SQL and Spark using Apache Zeppelin
›
#spark #hadoop #analytics #apache #zeppelin #scala I was looking for a cool dashboard based query interface for analytics. I stumble...
2 comments:
Sep 10, 2015
Json parsing, Scala way
›
Most java developers are familiar with json parsing and object mapping using Jackson library 's object mapper functionality that en...
2 comments:
May 5, 2013
Simple metastore creation for Hive in MySQL
›
For Hive, the meta-store is like the system catalog which contains metadata about the tables stored in Hive. This metadata is specified duri...
5 comments:
Dec 19, 2012
Data and Brain
›
#bigdata Came across an interesting presentation on Using Data to Understand Brain. Using Data to Understand the Brain from jakeho...
Dec 18, 2012
Eventual Consistency
›
#distributed #nosql
Dec 17, 2012
Unicode features in various languages
›
Here’s what each language natively supports in its standard distribution . Unicode J avascript ᴘʜᴘ Go Ruby Python ☕ Java...
Aug 1, 2012
Machine generated data
›
At first, the term " machine-generated data " can be confusing. One would think, every data is (or are?) generated from one de...
Jul 23, 2012
Nodeable - Realtime Insights
›
#Nodeable is a good example of generating #insights from #bigdata or the real time trickle feeds. It uses Twitter's Storm for the pro...
Apr 20, 2012
Hadoop meetup @inmobi Bangalore
›
Had a chance to attend the #hadoop #meetup today at # Inmobi Bangalore. Arun Murthy and Suresh Srinivasan from Hortonworks made presen...
Mar 9, 2012
Creating index in Hive
›
Simple: CREATE INDEX idx ON TABLE tbl(col_name) AS 'Index_Handler_QClass_Name' IN TABLE tbl_idx; As to make pluggable indexin...
13 comments:
Mar 6, 2012
What's it about Cascading?
›
Cascading helps manipulating data in Hadoop . It is a framework written in Java which abstracts map reduce that allows to write script...
Mar 1, 2012
O Blimey! TED Talk 2023
›
Prometheus film, going viral... like a fire that danced at the end of the match. Aha! cybernetic life-forms... The only "pur...
Feb 29, 2012
Tinker, Tailor, Soldier, Spy and The Perspicacious "Collusion"
›
Collusion! A secret agreement between two or more parties for a fraudulent, illegal, or deceitful purpose. In this battleground of p...
Feb 28, 2012
The mythical unstructured data!
›
As semantic web and big data integration gaining its fus-ro-dah , enterprises are finding a way to harness any available form of information...
Feb 27, 2012
Consistent Hashing
›
What is a consistent hash function? A consistent hash function is one which changes minimally as the range of function changes. What...
Feb 26, 2012
About Bulk Synchronous Parallel(BSP) model
›
As an alternative to mapreduce paradigm, there is another parallel computing model called Bulk Synchronous Parallel(BSP). A BSP computer is...
Jul 15, 2011
Boyer and Moore's Linear Time Voting Algorithm
›
This is a simple linear time voting algorithm designed by Robert S Boyer and J Stother Moore in 1980 which is discussed in their paper MJRTY...
Jul 14, 2011
Descending Iterator and Adapter pattern
›
There is a descending iterator in linked list implementation in Java SDK. A humble private class in LinkedList. A good example of adapter. ...
May 30, 2011
Using Avro to serialize logs in log4j
›
I have written about serialization mechanism of Protocol Buffers previously. Similarly, Apache Avro provides a better serialization framew...
May 23, 2011
Bloom Filters
›
A Bloom filter is a probabilistic data-structure. This can be used to store a set of data in a space-efficient manner. For eg; a distribute...
3 comments:
May 9, 2011
Labs
›
avro-log4j - serialization mechanism to provide a layout for log4j firetester - A simple RESTful services testing tool written in ...
Apr 13, 2011
Interesting uses of sun.misc.Unsafe
›
Inspired from the question that found in stackoverflow, I started looking up for the uses. I found some pretty interesting ones... VM ...
1 comment:
Apr 12, 2011
Accessing data from storage system using XAM API
›
If you are familiar with POSIX which is a collection of standards which enables portability of applications across OS platforms. It provi...
›
Home
View web version