职位描述
职责描述:
我们目前正在招聘一位积极和热情的数据工程师,负责开发,优化和运营我们的数据处理平台和应用程序,用于帮助实施创新数据平台流程应用程序并提供明智的见解。在此角色中,您将为大型数据处理管道的平台建立可扩展,高效,自动化的流程,并建立基于开放数据池的有见地的应用程序。
工作要求:
*构建基础设施,将大量原始数据收集并处理成适合分析的数据
*在大量数据之上设计和实现近实时数据应用
*加强数据产品,如查询/监控/分析,以减轻普通用户在大数据环境中的工作
*支持业务团队的运营监控要求
*跟踪风险操作问题,并与合作伙伴进行调查
任职要求:
*本科以上学历,3年以上的相关工作经验
*熟练使用Java 或Scala或Python编程语言;同时擅长使用Bash或Perl
*具有定制数据处理管道设计,实施和维护方面的经验
*拥有基于大数据平台的编程经验(Hadoop / Spark / Kafka / Storm)
*丰富的数据库使用经验(MySQL / MongoDB / Redis)
*最好具有网页开发和机器学习应用的经验
*了解使用基于AWS的S3,EMR,Data pipeline,Lambda,Kinesis
*良好的英语口语和书面沟通能力
Responsibilities:
We are currently recruiting for a motivated and passionate Data Engineer who is responsible for developing, optimizing and operating our data pipeline and applications, which are used to help implement innovation data platform process applications and give smart insights. In this role you will establish scalable, efficient, automated processes for large scale data pipelines, as well as building insightful applications based on open-ended pool of data.
Job Requirements:
* Build the infrastructure to gather and process large volumes of raw data into data suitable for analysis
* Design and implement near real-time data applications on top of huge amount of data
* Enhance data product like query/monitoring/analysis to ease common users work in big data environment
* Support operational monitoring requests from business team
* Track risk operation issues and work with partners on investigation
Basic Qualifications
* Bachelor’s degree with 3 years working experience in relevant field
* Strong programming skills in Java, Scala or Python; hands on with one of the scripting languages: Bash or Perl
* Experience in custom data pipeline design, implementation and maintenance
* Experience in big data programming (Hadoop/Spark/Kafka/Storm)
* Experience in database usage (MySQL/MongoDB/Redis)
* Experience in web development and machine learning application is a big plus
* Knowledge of AWS Data Stack using S3, EMR, Data Pipeline,Lambda,Kinesis
* Good verbal and written communication skills in English
企业介绍
PatSnap is a disruptive market leading provider of intellectual property
analytics, for analysing technology trends, accelerating innovation, market
planning, competitor intelligence and maximising returns on existing and new
IP assets. It is used by over 3000 organisations globally including Nasa, GE,
Lego, Vodafone, Ferrari, Siemens, Xiaomi and China Mobile. The company is
backed by world class venture capital firms such as Sequoia, Summit
Partners, Shunwei and Vertex Ventures. With an impressive revenue growth
rate of 1078% from 2014 to 2016, PatSnap was ranked 44 on “Deloitte
Technology Fast 500”.
智慧芽是一家全球领先的知识产权信息服务(SaaS)提供商,基于专利大数据,
帮助分客户析和了解最新技术发展趋势并加速创新、获取竞争对手情报、科学
进行市场布局以及实现知识产权价值最大化,提高企业核心竞争力。目前全球
已有超过3000 多机构和企业成为智慧芽的客户,如美国宇航局、通用、乐高、
沃达丰、法拉利、西门子、小米、中国移动等。智慧芽得到了包括红杉、顶峰
投资、顺为、淡马锡祥峰基金等世界顶级风险投资机构的青睐和投资。2014~
2016 年,智慧芽的营业收入以超过1078%的增长率快速发展,被评为德勤亚太
区高科技高成长500 强企业,并获得第44 位的优质排名。