Dealing big-data from HBase with Spark, using Java.
In hbase, records are kept in the following format:
| RowKey | Info:carID | Info:name | Info:time |
|---|---|---|---|
| 0001 | 5556 | Honda | 0214 |
| 0002 | 5557 | Toyota | 0311 |
| 0003 | 5556 | Honda | 0215 |
| 0004 | 5558 | Buick | 0412 |
Feb. 15: After mapping and combineByKey, we get several lists goes that:
| 5556: 0214, 0215 |
|---|
| 5557: 0311 |
| 5558: 0412 |