Hadoop experiencing growing pains in lamestream businesses
Apache Big DataFast, unbridled growth has hurt adoption of Hadoop, according to a leading advocate of the technology. John Mertic, director of program management at ODPi, said that work on Hadoop was...
View Articlespark 问题
问题描述 1 使用 spark-shell , sc.textFile( “hdfs://test02.com:8020/tmp/w”).count 出现如下异常: java.lang.RuntimeException: Error in configuring object at...
View Article"A Critique of the CAP Theorem"
This week I read a paper called A Critique of the CAP Theorem by Martin Kleppmann . I thought it was super interesting and I wanted to tell you why! And maybe convince you to read it. The CAP theorem...
View ArticleDBA skills developers should know…
In many smaller companies or departments, there is no dedicated DBA. In these cases, it falls on *someone* in the office to come to grips with the blackbox that is the database server, keep it running,...
View ArticleToo Busy to Create Your Own Visualizations? Just Leverage the Neo4J Console
In Vueling we have almost 700 applications using hundreds of databases, queues, FTP sites, web services, remote file shares etc. Understanding how everything fits together is a lost battle without...
View Article10 things you need to worry about in 2017
Each year,including last year, I’ve supplied you with “areas of concern”―that is, stuff that might not go well for you or our comrades in the coming 12 months. I’m happy to oblige once again this year...
View ArticleThe Top Ten Certification Exam Books of 2016
As you’re studying for that last certification of the year, it is totally reasonable if you determine that you need some additional material for studying. Have no fear! CBT Nuggets is here to help you...
View ArticleReplication with Epsilon... or Not
I’m working on an events database calledEpsilon. I have a bunch of things that I’d like to build on top of Epsilon, and I really want to be a solid data platform. That means one of the key...
View ArticleAzure DocumentDB overview
DocumentDB is a NoSQL database supports fast performance, ease of development and high availability. Its schema free NoSQL database. As data is stored in JSON format reads and writes are served under...
View ArticleHadoop Weekly Issue #193
13 November 2016 Welcome to a double-issue of Hadoop Weekly. There's lots of breadth in this week's issue―from Apache Avro to Apache Spark and everything in between. Technical The Cloudera blog has a...
View ArticleWriting IAM Policies CAREfully
This isn’t a tutorial, just a conceptual framework that’s helped me write better IAM policies. It’s extra useful when an app needs a group of services like DynamoDB, S3, and Kinesis. The method is...
View Article每日一博 | Redis 主从配置心得及其高可用方案
redis主从复制过程: 当配置好slave后,slave与master建立连接,然后发送sync命令。无论是第一次连接还是重新连接,master都会启动一个后台进程,将 数据库快照保存到文件中,同时master主进程会开始收集新的写命令并缓存。后台进程完成写文件后,master就发送文件给slave,slave将...
View ArticleMongoDB 3.2.11 和 3.4.0 rc4 发布
MongoDB 3.2.11 和 3.4.0 rc4 发布了。 MongoDB是一个介于关系数据库和非关系数据库之间的产品,是非关系数据库当中功能最丰富,最像关系数据库的。他支持的数据结构非常松散,是类似...
View Article大数据必读文献
推荐人:张俊林 小编推荐:本文献是张俊林在写作 《大数据日知录》 时的整理的大数据必读文献,小编基本翻完了这本书,作者用心整理了大数据相关的算法和架构,并很好的把握了各个技术点的难易程度,是了解大数据技术全貌的较深科普类著作。而本文献则是大家学习进阶各个技术的重要参考。中国有句古话“师傅领进门,修行在个人”。读这本书只是进门,读这篇博文提供的参考文献并付诸实践则是每个技术人员的修行。...
View Article一个经验证可落地的秒杀系统实践思路
为什么要做秒杀?这个不难解释,最起码对于互联网电商业务来说很常见,那怎么样才能设计出相对比较完善的秒杀策略呢?我觉得这其中有两个关键点: 谈到秒杀,我们的第一反应就是多人次抢一款或者几款产品导致瞬间产生的流量峰值很大,那如何支持高并发就是其中的重点之一。 库存怎么来锁,每件商品有限定的秒杀库存数,我们怎么来变更库存信息,mysql数据库直接操作么,又或者是通过Java中的原子性类来维护库存信息?...
View ArticleBuilding a Modern Data Architecture with Enterprise Level Governance
This guest blog comes from Attivio, a certified Hortonworks Partner with the Governance Ready Badge, indicating that the Attivio Platform is recognized by Hortonworks as an application that integrates...
View ArticleDell EMC makes Hadoop less complex for customers
Dell EMC believes customers need to focus on delivering deeper insights and enhanced data-driven decision making. This is why Dell EMC provides expert guidance and knowhow to streamline the...
View ArticleOracle RDBMS + Neo4j: Migrate or Sync a Subset of Data
By Gabe Stanek & Stefan Kolmar , Neo4j Field Engineering Team | November 21, 2016 There are several ways that Neo4j can work with Oracle RDBMS (or any other RDBMS). The approach you take depends...
View ArticleRedis实战(五) 聊聊Redis使用场景
随着数据量的增长,mysql已经满足不了大型互联网类应用的需求,因此,Redis作为内存数据库,很好的作为其架构上的补充。在某些场景下,可以充分的利用Redis的特性,大大提高效率。 使用场景说明 计数器 数据统计的需求非常普遍,通过原子递增保持计数。例如,点赞数、收藏数、分享数等。 排行榜 排行榜按照得分进行排序,例如,展示最近、最热、点击率最高、活跃度最高等等条件的top list。...
View ArticleRedis第一个版本源码分析-启动过程分析1
写在最前面:重新开学去学习一些经典的开源系统,通过学习这些经典和常用的开源系统来提升自己的技术能力和技术思维。这些开源系统也可能是我们工作中经常遇到的,通过学习他们的实现原理和设计思路,能够更好的驾驭这些开源系统,当然更重要的是学习他的思想,通过学习这些思想可以帮助我们提供系统的设计能力。...
View Article