Which Little Pig Chose Your Real-Time Inventory Technology?
As a trusted advisor to many enterprise architects and e-commerce business executives I thought I would share my experience with a trending technology evaluation across the industry’s biggest brands...
View ArticleA convenient workload generator for Couchbase in OpenShift
Our first Guest Post from the Community Writing Program comes from Nicolas Motte . Nico is a full-stack engineer in the South of France. He released several native and hybrid mobile applications to...
View Article我是如何开发公司年会抽奖系统的?
需求出现 年会将近,而年会抽奖环节必不可少,但是抽奖系统却还没有。所以某一天,PM走过来说:小伙,手头的需求修完成了吧!在年会开始之前必须做出一个抽奖系统。这个系统很简单,后台可以设置总金额,然后每个用户可以获得的金额范围,金额派完则显示很遗憾没有中奖,还要设置抽奖活动时间。 需求分析...
View ArticleHortonworks 2016 Year in Review
As we kick off the new year I wanted to thank our customers, partners, Apache community members, and of course the amazing Hortonworks team, for an amazing 2016. Let’s take a step back and look at some...
View ArticleTelemetry meets HBase (again)
At the end of December AWS announced that HBase on EMR supported S3 as data store. That’s great news because it means one doesn’t have to keep around an HDFS cluster with 3x replication, which is not...
View Article什么是MongoDB、特点、历史、下载和工具
什么是MongoDB ? MongoDB 是由C++语言编写的,是一个基于分布式文件存储的开源数据库系统。 在高负载的情况下,添加更多的节点,可以保证服务器性能。 MongoDB 旨在为WEB应用提供可扩展的高性能数据存储解决方案。 MongoDB 将数据存储为一个文档,数据结构由键值(key=>value)对组成。MongoDB 文档类似于 JSON...
View Article零DBA、零运维,30个人的技术团队,照样PK基于Hadoop的百人团队
更多精彩内容参见 云栖社区大数据频道 : https://yq.aliyun.com/big-data...
View ArticleProcessing Image Documents on MapR at Scale
There has been a lot of research in document image processing over the past 20 years, but not much research has been done in terms of parallel processing. Some of the solutions proposed for parallel...
View ArticleHadoop平台下森林大气温度与地表温度关联研究
Hadoop平台下森林大气温度与地表温度关联研究 杨博文,汪子炎,荀文婧,刘晓峰,朱正礼 本文基于南京紫金山地区森林关于大气温度、地表下5 cm处的土壤温度的大数据,对传统的数据分析方法作出改进,提出了用云计算对林业物联网数据进行分析的方法,运用Alphabet公司的Hadoop云计算平台的MapReduce大数据处理框架,研究大气温度和地表下5 cm处的土壤温度间的关系。本文利用...
View ArticleApache 基金会宣布 Apache Eagle 成为顶级项目
2017年1月10日, 由超过350个开源项目及创新计划,全部由开发志愿者,治理志愿者及孵化志愿者组成的Apache软件基金会(ASF),宣布Apache Eagle已经从Apache孵化器项目毕业,正式升级成为顶级项目(TLP),这标志着该项目的社区和产品依照ASF精英管理的流程和原则顺利运作。 Apache...
View Article一场屠戮MongoDB的盛宴反思:超33000个数据库遭遇入侵勒索
许多人没有想到,去年12月一件不起眼的小事,在新年伊始却演变成了一场屠杀。如今,受害的一方似乎正由于自身的疏忽和迟钝而显得愈发无力反抗,一个接一个倒下。 截止本周三(1月11日),已经有20名以上的黑客加入到这场对MongoDB用户一边倒的碾压中来,遭到入侵、勒索的数据库超过了33,000个,并且这一数字还在不断上升中。(源自凯捷咨询的Niall...
View ArticleIncremental consistency guarantees for replicated objects
Incremental consistency guarantees for replicated objects Guerraoui et al., OSDI 2016 We know that there’s a price to be paid for strong consistency in terms of higher latencies and reduced...
View ArticleHave You Thanked Your Data Steward Today?
The other day I Googled, “the problem with a modern data architecture.” Of course, at Attivio we’re big evangelists for an MDA, but it’s always interesting to see what the contrarians have to say....
View ArticleRack Awareness in Hadoop HDFS
1. Introduction This tutorial will help you in understandingHadoop rack awareness concept, racks in Hadoop environment, why rack awareness is needed, replica placement policy in Hadoop via Rack...
View ArticleMongoDB 数据库勒索,中国受害者数量超乎你的想象,SOS!
今天,雷锋网编辑在刷朋友圈时,看到腾讯安全专家召唤提到:国内已经出现多起针对 MongoDB、ElasticSearch 的攻击勒索案例了。 什么?最近在国外大火的MongoDB 勒索已经到中国了?! 对此,雷锋网马上与召唤取得联系,得知仅国内某安全公司近期就检测到4 起 针对国内 MongoDB 、ElasticSearch进行的勒索案例。 不过,受害者绝对不止这些。 无需身份验证的开放式...
View ArticleHow-to: Fuzzy Name Indexing in Apache Hadoop with Rosette and Cloudera Search
In this guide, learn how to use Cloudera Search with Basis Technology’s Rosette to perform fuzzy name searches in multiple languages and scripts. Our thanks to Basis Technology team (Jeanne Le Garrec,...
View Article【HBase】使用CopyTable备份表
本博客文章如无特别说明,均为原创!转载请注明出处:Big data enthusiast( http://www.lubinsu.com/ ) 本文链接地址: 【HBase】使用CopyTable备份表 ( http://www.lubinsu.com/hbase-copytable/ ) CopyTable用法: 执行命令前,需先创建表...
View ArticleDeploying Pull Requests with Docker
No Comments The Git repositories in my current project are hosted on Bitbucket Cloud. Any code changes have to go through pull requests. Jenkins builds the pull requests and gives its approval if the...
View Articleredis进阶5-管道笔记 数据库 数据库学习 redis进阶 管道笔记
redis进阶5-管道笔记: 1 客户端和redis使用tcp连接。 2 执行多个命令时,每个命令都需要等待上一个命令执行完。 3 redis底层通信协议对管道(piplineing) 提供了支持。 通过管道可以一次性发送多条命令,并在执行完后一次性将结果返回。
View Article【数据库】――聚簇索引和非聚簇索引 数据库 数据库学习 聚簇索引 非聚簇索引
【数据库】——聚簇索引和非聚簇索引。聚簇索引是一种特殊索引,使数据按照索引的排序顺序存放表中。实际上重组了表中的标准。 当数据按值的范围查询时,聚簇索引就显得特别有用。当大量数据修改的时候,不再适合使用聚簇索引 建立聚簇索引的思想 1、大多数表都应该有聚簇索引或使用分区来降低对表尾页的竞争,在一个高事务的环境中,对最后一页的封锁严重影响系统的吞吐量。...
View Article