DE Jobs

Search from over 2 Million Available Jobs, No Extra Steps, No Extra Forms, Just DirectEmployers

Job Information

3M Principal Data Engineer-R01128212 in MAPLEWOOD, Minnesota

Job Description:Principal Data EngineerCollaborate with Innovative 3Mers Around the WorldChoosing where to start and grow your career has a major impact on your professional and personal life, so its equally important you know that the company that you choose to work at, and its leaders, will support and guide you. With a diversity of people, global locations, technologies and products, 3M is a place where you can collaborate with othercurious, creative 3Mers.This position provides an opportunity to transition from other private, public, government or military environments to a 3M career.The Impact Youll Make in this RoleThe Principal Data Engineer will join the Corporate Research Systems Lab (CRSL) to develop scalable Data Systems. As part of an agile team, you will enable applications in diverse markets including energy, manufacturing, personal safety, transportation, electronics, and consumer. You will have the opportunity to design and support an Enterprise Data Mesh to empower informatics and digital technologies for users across the globe:Architect, design, and build scalable, efficient, and fault-tolerant data operations.Collaborate with senior leadership, analysts, engineers, and scientists to implement new mesh domain nodes and data initiatives.Drive technical architecture for accelerated solution designs, including data integration, modeling, governance, and applications.Explore and recommend new tools and technologies to optimize the data platform.Improve and implement data engineering and analytics engineering best practices.Collaborate with data engineering and domain nodes teams to design physical data models and mappings.Work with scientists and informaticians to develop advanced digital solutions and promote digital transformation and technologies.Perform code reviews, manage code performance improvements, and enforce code maintainability standards.Develop and maintain scalable data pipelines for ingesting, transforming, and distributing data streams.Advise and mentor 3M businesses, data scientists, and data consumers on data standards, pipeline development, and data consumption.Your Skills and ExpertiseTo set you up for success in this role from day one, 3M requires (at a minimum) the following qualifications:Bachelors degree or higher (completed and verified prior to start) from an accredited university.Twelve (12) years of professional experience in data warehouse/lakehouse design and development in a private, public, government or military environmentCompletely proficient in advanced SQL, Python/PySpark/Scala (any object-oriented language concepts), ML LibrariesMust have hands-on experience in Python to extract data from APIs, build data pipelines.Additional qualifications that could help you succeed even further in this role include:Exceptional background in data engineering, data systems, and data governance and having comfort working with structured and unstructured data and analyses. Exposure to data and data types in the Materials science, chemistry, computational chemistry, physics space a definite plus, but not required.Proficiency in developing or architecting modern distributed cloud architecture and workloads (AWS, Databricks preferred). Familiarity with data mesh style architecture design principles.Proficiency in building data pipelines to integrate business applications and procedures.Solid understanding preferred of advanced Databricks concepts like Delta Lake, MLFlow, Advanced Notebook Features, Custom Libraries and Workflows, Unity Catalog, etc.Experience with AWS cloud computing services and infrastructure developing data lakes and data pipelines leveraging multiple technologies such as AWS S3, AWS Glue, Elastic MapReduce, etc. and awareness of considerations for building scalable, distributed computational systems on Spark.Experience with stream-processing systems: Amazon Kinesis, Spark, Storm, Kafka, etc.Hands-on experience with relational SQL and NoSQL databases.Data quality and validat on principles experience, security principles data encryption, access control, authentication & authorization.Deep experience in definition and implementation of feature engineering.Experience with Docker containers and Kubernetes, experience developing or interacting with APIs.Experience in using data orchestration workflows using open-source tools like Temporal.io, Apache Airflow is a plus.Knowledge of data visualization tools like Dash Apps, Tableau, Power BI, etc.Good experience with agile development processes and concepts with leveraging project management tools like JIRA and Confluence.Devise and implement data engineering best practices across teams, optimize and redesign existing data engineering solutions to improve efficiency or stability, as well as monitoring of and consulting with domain node teams.Excellent interpersonal, collaborative, team building, and communication skills to ensure effective collaborations with matrixed teams.Travel: May include up to 10% domestic/internationalRelocation Assistance: May be authorizedLocation: Maplewood, MN or may consider remote U.S. work locationMust be legally authorized to work in country of employment without sponsorship for employment visa status (e.g., H1B status).Supporting Your Well-being3M offers many programs to help you live your best life both physically and financially. To ensure competitive pay and benefits, 3M regularly benchmarks with other companies that are comparable in size and scope.Chat with MaxFor assistance with searching through our current job openings or for more information about all things 3M, visit Max, our virtual recruiting assistant on 3M.com/careers.Applicable...Equal Opportunity Employer - minorities/females/veterans/individuals with disabilities/sexual orientation/gender identity

DirectEmployers