 Kishore
    Kishore 
    
    
        
        
        
        Hii
    
 
     Kishore
    Kishore 
    
    
        
        
        
        😌
    
 
     Kishore
    Kishore 
    
    
        
        
        
        How to convert string to time stamp in dataframe
        My column value is 
        2013-07-25 00:00:00.0
    
 
     Anonymous
    Anonymous 
    
    
        
        
        
        https://www.oreilly.com/ideas/handling-real-time-data-operations-in-the-enterprise
    
 
     Ranganath
    Ranganath 
    
    
        
        
        
        Hi ,
        I have a table in hive whose delimiter is '^' and data file also loaded with '^' delimiter but now we got a requirement to change the delimiter to '^A' ....I changed the delimiter in table but the data now is being displayed as NULL as data in file having different delimiter ...how can I change delimiter at file level also ...
         Or do I need to create staging table for all table but this will be a problem as I have 20 tables ...
    
 
     Anonymous
    Anonymous 
    
    
        
        
        
        Could help me starting in data engineering, 
        
        best plan?
    
 
     KrivdaTheTriewe
    KrivdaTheTriewe 
    
    
 
     Pedro
    Pedro 
    
    
        
        
        
        Hey people !
    
 
     Alexander
    Alexander 
    
    
        
        
        
        Hi!
    
 
     Pedro
    Pedro 
    
    
        
        
        
        How is going ?
    
 
     Pedro
    Pedro 
    
    
        
                    
                        
                            
                            not much, whats up u?
                        
                    
                
        
        
        cool
        i'm fine
        trying to solve a problem with fixed-width row to print in files to send to a legacy system to consume and then back to HDFS
    
 
 
     Gurmohit Singh
    Gurmohit Singh 
    
    
        
        
        
        Hello :)
    
 
     Bevilaqua
    Bevilaqua 
    
    
        
        
        
        Some here who knows Google Dataprep tool?
    
 
     Pedro
    Pedro 
    
    
 
     Anonymous
    Anonymous 
    
    
        
        
        
        Dear experts, I am new on grafana to make a monitoring dashboard. 
        
        Ive connect my data source with CloudWatch. And monitoring an AWS/EC2 instance
        
        But only 14 metrics were available on grafana. 
        
        I need most of all metrics of ec2 instance. 
        
        Need help thanks
    
 
     Pedro
    Pedro 
    
    
 
     Anonymous
    Anonymous 
    
    
        
        
        
        Hello everyone... I have testing 3.8 years of experience.. Im very much intrested to work as data engineer .. Can someone help me with the skills required to restart my career as data engineer...
        Please let me know If there are any training institutions which can help to upskill my self in Hyderabad... Help me with ur ideas and suggestions
    
 
     Saipul
    Saipul 
    
    
 
     Saipul
    Saipul 
    
    
        
        
        
        maybe you can take course on dataquest
    
 
     Saipul
    Saipul 
    
    
        
        
        
        choose data engineer track
    
 
     Saipul
    Saipul 
    
    
        
        
        
        online course
    
 
     Pedro
    Pedro 
    
    
        
        
        
        Just study what's does a data engineer. There is some great blog posts about that over the internet
    
 
     Pedro
    Pedro 
    
    
        
        
        
        Medium is a good starting point
    
 
     Sam
    Sam 
    
    
        
        
        
        Hello,
        
        Could anybody are familiar with airflow?
        
        I got stuck with hive jdbc connection.
        
        Could you please take a look at the problem through this link?
        
        https://stackoverflow.com/questions/55134669/cannot-modify-mapred-job-name-at-runtime-it-is-not-in-list-of-params-that-are-a
    
 
     Pedro
    Pedro 
    
    
        
        
        
        Airflow and jdbc ?
    
 
     Pedro
    Pedro 
    
    
        
        
        
        Ooh
    
 
     Pedro
    Pedro 
    
    
        
        
        
        I see
    
 
     Pedro
    Pedro 
    
    
        
        
        
        Hive and jdbc
    
 
     Pedro
    Pedro 
    
    
        
        
        
        Sorry never worked with those
    
 
     Sam
    Sam 
    
    
        
        
        
        yeap, hive jdbc.
    
 
     Ashhadul
    Ashhadul 
    
    
        
        
        
        Hi
        
        Anyone from Hyderabad, India
        
        Interested to learn data science
    
 
     Anonymous
    Anonymous 
    
    
        
        
        
        Yes
    
 
     Anonymous
    Anonymous 
    
    
        
        
        
        I m interested
    
 
     KrivdaTheTriewe
    KrivdaTheTriewe 
    
    
        
        
        
        https://www.youtube.com/watch?v=K6oZuB8_dU8&feature=youtu.be
    
 
     Anonymous
    Anonymous 
    
    
 
     Saipul
    Saipul 
    
    
 
     Oleksandr
    Oleksandr 
    
    
        
        
        
        No Java, Scala and JS? I don’t think so.
    
 
     Oleksandr
    Oleksandr 
    
    
        
        
        
        Also nowadays majority goes. Python -> AWS/Azure. Why bother learning stuff in between.
    
 
     Pedro
    Pedro 
    
    
        
        
        
        The best language for the job is the right thing. The best languages for some cases are the one which has middle set of features and some sort of simplicity, mainly those for systems programming (Go, Rust, Elixir are good ones)
    
 
     Pedro
    Pedro 
    
    
        
        
        
        This mainstream learning path is all wrong and really passed away out of reality
    
 
     Pedro
    Pedro 
    
    
        
        
        
        A good engineer, concerning big data, will employ the language which best fits the problem and not what some dinosaurs is telling as single truth
    
 
     A
    A 
    
    
        
        
        
        Visual Studio Code Lint displays errors about imports that dont seem to harm the scripts I run usin same VS Code.
        
        Is it a known issue?
    
 
     Ashhadul
    Ashhadul 
    
    
        
        
        
        “Generic Review Scraper for any Android App” by Ashhadul Islam https://link.medium.com/vjLOgtZzvW
    
 
     B
    B 
    
    
        
        
        
        what do you guys think about serverless data lake?
    
 
     B
    B 
    
    
        
        
        
        mainly on AWS
    
 
     Grigory
    Grigory 
    
    
        
        
        
        @voreh hey; what do you mean by that? what would be a serverless component?
    
 
     Grigory
    Grigory 
    
    
        
        
        
        where is the data
    
 
     Grigory
    Grigory 
    
    
        
        
        
        or you talk about that AWS blogpost?
    
 
     Grigory
    Grigory 
    
    
        
        
        
        that’s fine; if you have enough will to create your own serverless API around any data storage - why not to do it; sounds okay
    
 
     B
    B 
    
    
        
        
        
        @pomadchin well, install all softwares and keep them updated, calculate the amount of RAM, SSD, sound too much for me when you can use a serverless data lake for your applications
    
 
     Grigory
    Grigory 
    
    
        
        
        
        its only for the API part; AWS proposes to store data in their s3 storage
    
 
     Grigory
    Grigory 
    
    
        
        
        
        not sure how is it 'serverless'
    
 
     Grigory
    Grigory 
    
    
        
        
        
        they just used this bazzword IMO
    
 
     Grigory
    Grigory 
    
    
        
        
        
        they also have smth pre done https://aws.amazon.com/ru/solutions/data-lake-solution/
    
 
     B
    B 
    
    
        
        
        
        well, i mean all the resources together to create a serverless infrasctrucutre, like the link you send it, (s3, lambda, kinesis, athena) without create an EC2 instance
    
 
     Grigory
    Grigory 
    
    
        
        
        
        well I guess by serverless they meant (and you mean) basically to lock on some vendor (aws in our case) software (SAAS)
    
 
     Alexander
    Alexander 
    
    
        
                    
                        
                            
                            what do you guys think about serverless data lake?
                        
                    
                
        
        
        Well under serverless you mean services without launching certain EC2 in your account (only API calls). You can start with S3 (storage), Lambda, Glue (ETL and DC), Athena (SQL engine), Kinesis, ALB, etc. building a data platform (we successfully leveraged them on one of the projects) covering data ingestion, transformation, discovery and DG, etc. but probably you'll come to using EMR, EKS, etc. also for further steps.
    
 
 
     Grigory
    Grigory 
    
    
        
        
        
        ++
    
 
     Ashhadul
    Ashhadul 
    
    
        
        
        
        https://mysentimeter.herokuapp.com/senti/
        
        I have built a simple generic django application that enables any end-user to perform text classification.
        
        User guide: https://medium.com/@ashhadulislam/generic-sentiment-analysis-on-cloud-5456131ba461
        
        Do use the same and let me know
    
 
     Sparsh
    Sparsh 
    
    
        
        
        
        https://link.medium.com/udTjR9CklX
    
 
     K.K
    K.K 
    
    
        
        
        
        Plz anyone can suggest me regarding the preparation of cloudera certification for data engineering..CCP DE 575.
    
 
     K.K
    K.K 
    
    
        
        
        
        Any body have some idea then plz ! Honourable members guide me..😔
    
 
     Saipul
    Saipul 
    
    
        
        
        
        Hello guys, are there use MapR in your company ?
    
 
     Ashhadul
    Ashhadul 
    
    
        
        
        
        Guys if you want to learn how to build a chatbot from scratch, I take one hour session in which I teach you to build a chatbot.
        It's not free, it's 500 rupees per person, per session.
        If you are interested to learn, please ping me.
        
        Pre requisite: hands: experience with python coding
    
 
     Rakz
    Rakz 
    
    
        
        
        
        Hi all, is there any implementation of scala operator in Apache airflow for scheduling. Is there any scheduler available for scala?, not looking for  a cron though
    
 
     Grigory
    Grigory 
    
    
 
     Rakz
    Rakz 
    
    
        
        
        
        I’m looking for an option to run scala code via Airflow or any suggestions on how to schedule a scala code without using crontab
    
 
     Grigory
    Grigory 
    
    
 
     Grigory
    Grigory 
    
    
        
        
        
        There is nothing special in a Scala code jar
    
 
     Rakz
    Rakz 
    
    
        
        
        
        Got it, this is because Airflow uses python. I wanted to check if any code samples available for scala based scheduler
    
 
     Grigory
    Grigory 
    
    
        
        
        
        You mean to write dags in Scala?
    
 
     Rakz
    Rakz 
    
    
 
     Grigory
    Grigory 
    
    
        
        
        
        only python