Key Takeaways from MongoDB World 2016
Last week I talked about our new Sumo Logic App for MongoDB and shared some of the interesting conversations I had with folks at MongoDB World. There were so many good sessions at the event, that I...
View ArticleMongoDB tips & tricks: Collection-level access control
As your database or project grows, you may be tasked with configuring access controls to allow different stakeholders access to the database. Rather than create a new user with full database...
View ArticleMongoDB World 2016参会全记录
上周和 @叶翔 一起参加了在 纽约举行的 MongoDB World 2016 ,MongoDB作为一个NewSQL数据库越来越受关注,这一点从 google趋势 、 百度指数 都可以看出来。...
View ArticleHadoop实验环境的搭建:standalone&pesudo cluster
无密登录 使用公钥-私钥的方式来进行避免密码输入和SSH认证。首先统一规定的hostname,按照 [2] 所指,修改3处hostname(注意root权限)。 # hostname Server # vi /etc/sysconfig/network //直接添加 Server # vi /etc/hosts //修改 127.0.0.1 的 hostname为 Server...
View ArticleHadoop Weekly Issue #177
04 July 2016 There were several announcements this week out of Hadoop Summit―HDP 2.5 preview, open-source software from Qubole and LinkedIn, and Mapr's Spyglass Initiative. In terms of technical...
View ArticleCompose's Little Bits 39 - CouchDB, MongoDB, Regrid, RediSearch, Minikube,...
CouchDB 2.0 gets its first release candidate, MongoDB gets 3.2.8 out, Regrid for RethinkDB gets an update, RediSearch shows the power of Redis modules, Minikube lets you do Kubernetes on your desktop,...
View ArticleBasho Open Sources Time Series Database Riak TS 1.3
Basho Technologies announced the open sourcing of Riak TS 1.3. Riak TS is specifically geared for handling time series data it supports fast write and query for time series data. In addition, Riak TS...
View Article分析比较多表查询中的IN与JOIN
IN 是子查询的关键字,JOIN 是连接的关键字,项目开发中经常会使用到多表查询,而子查询与连接正是实现多表查询的重要途径。那两者是怎么运行的?IN与JOIN哪个更好?下面就来分析与比较。 现在有test1与test2两张表,都没有任何像主键,外键那样的约束,且只有一个字段。两张表是非相关的。 现在使用IN关键字实现子查询,test2作为子查询表(外部表): 查看执行计划:...
View Article中国国际地区SQL表
SET FOREIGN_KEY_CHECKS=0;-- ------------------------------ Table structure for rc_district-- ----------------------------DROP TABLE IF EXISTS `rc_district`;CREATE TABLE `rc_district` ( `district_id`...
View Article基于Hadoop生态圈的数据仓库实践――进阶技术(三)
三、维度子集 有些需求不需要最细节的数据。例如更想要某个月而不是某天的记录。再比如相对于全部的销售数据,可能对某些特定状态的数据更感兴趣等。这些特定维度包含在从细节维度选择的行中,所以叫维度子集。维度子集比细节维度的数据少,因此更易使用,查询也更快。 本节中将准备两个特定维度,它们均取自现有的维度:月份维度(日期维度的子集),Pennsylvania州客户维度(客户维度的子集)。 1....
View Articlemongodb3.0分片及java代码连接操作测试(开启用户验证)
最近抽时间搭建了一下mongodb简单的分片,整个过程还算是蛮顺利,只不过在用户验证这一块遇到了一些问题,好在最后终于搞定。 一、服务器搭建过程: 1、安装四个mongodb:一个作为config、一个作为mongos、另外两个作为主要数据存储的服务器(机器ip192.168.0.201),如图:...
View Article使用sqluldr2实现将Oracle数据无落地快速导入Greenplum数据库
环境信息: Greenplum:Greenplum Version: 'postgres (Greenplum Database) 4.3.8.2 build 1' Oracle:Release 11.2.0.1.0 Production 1. Oracle客户端部署...
View Articlemongodb命令行group分组和java代码中group分组
group分组统计是数据库比较常用的功能,mongodb也不例外。不过相对于普通的增删改查,group操作就略微麻烦一些,这里对group在shell中的操作、使用java原生代码操作以及集成spring进行操作进行一个简单的归纳总结,分组的途径和方法应该都有多种,这里每一样只举一例。 本例中数据源如下: 需要使用group实现的目的是:按年龄分组统计出每一组的数量。 1、mongo...
View Article从 Trie 到 Double Array Trie
上周大伙讨论Double A rray Trie的时候,表示难以想象DAT的作者当初是怎么设计出这样的数据结构的。我当时分享了"事后诸葛亮"的猜测分析。稍作整理再闲聊一下,务虚,少细节。 Trie Trie是一种树状数据结构,常用于词典存储,也称前缀树。 看一个简单的例子,假设字符集只有三个字符{a, b, c}, 词典包含5个词{aaa, ac, b, c, cbc}。...
View ArticleAmbari服务依赖关系图生成脚本
1. 生成服务依赖关系 #!/usr/bin/python import sys import commands import json def genDependString(ip): url="curl --user admin:admin -H 'X-Requested-By:admin'...
View ArticleReal-time multi-tenant migration with Cloudant NoSQL database
Executing data migrations on a multi-tenant, document-based, enterprise-level cloud database system is as difficult to do as it is to say. NoSQL document-based storage has proven itself to be the best...
View ArticleHadoop Summit 2016: The Growth Accelerates
Hadoop Summit in San Jose wrapped up a few weeks ago. This was the ninth year and, wow, have we come a long way. It’s been a decade for Apache Hadoop and five years for Hortonworks. Hadoop Summit is...
View ArticleGetting Started with B2 CLI
After having installed the B2 CLI tool , we want to start trying it out by backing up a folder to the cloud. The first thing we want to do is create a bucket to sync to. A bucket is like a top level...
View ArticleELK+Redis 最佳实践
上个月老板让我搭一个日志分析系统,选用了 ELK 作为技术栈,网上的文章较为混乱,前期刚接触的时候查资料踩了很多坑,所以根据我总共一个星期的接触(包括安装和后期运维),大致总结一下我个人的最佳实践。 本文主要内容为: 架构逻辑 安装 配置 Docker 方案 这张图相信足以证明 ELK Stack 的强大之处了: 架构逻辑 E(lasticsearch) L (ogstash) K (ibana)...
View ArticleSpeed Tiles and Traffic-Influenced Routing
Valhalla, the open source routing engine which powers our Turn-by-Turn service, was designed to be extensible. Something we’ve wanted to add is traffic-influenced routing but how would this work? We’ve...
View Article