Genpact Bengaluru, KarnatakaBengaluru
Responsibilities • Should be aware of the end-to-end SDLC process in the python world • Keep and push for the high bar of development best practices: test coverage, design patterns, documentation, and self-documenting code • Good exposure in RDBMS, SQL, and relevant tools • Develop ETL pipelines to move data from source systems (CRM weblogs, etc.) into data lake (HDFS, S3, etc.) • Deep experience in developing data processing tasks using PySpark such as reading data from external sources, merging data, performing data enrichment and loading into target data destinations • Knowledge of ORM libraries