Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
246 views
in Technique[技术] by (71.8m points)

java - Is there a way to parallel insertion of records in hive tables using jdbc?

I want to use JDBC to insert 1 million records in Hive table.

  1. The Dataset is very huge. It has 2400 columns.
  2. At the moment I am using simple insert statements with 1000 records in values section.
  3. This is because I can not use addBatch() as it didnt work in hive driver. And also it creates a map reduce job for each insert statement which takes time to complete.
  4. I am not able to add more records in values as the query string length becomes problem.
  5. I tried to run multiple insert statements(having 1000 records in values of each) using threadpool in ExecutorService. But it didn't work as it seems like it still runs in sequence.
  6. Is there any other way to achieve parallelism in this case?
question from:https://stackoverflow.com/questions/66048934/is-there-a-way-to-parallel-insertion-of-records-in-hive-tables-using-jdbc

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
Waitting for answers

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...