You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Here, in each batch interval, the RDD generated by stream1 will be joined with the RDD generated by stream2. You can also do leftOuterJoin, rightOuterJoin, fullOuterJoin. Furthermore, it is often very useful to do joins over windows of the streams. That is pretty easy as well.
看到sparkStreaming官网上介绍的join
Here, in each batch interval, the RDD generated by stream1 will be joined with the RDD generated by stream2. You can also do leftOuterJoin, rightOuterJoin, fullOuterJoin. Furthermore, it is often very useful to do joins over windows of the streams. That is pretty easy as well.
具体的实现细节是说这个join只是的那个批次内的多个stream的join,暂时还无法做到跨批次的?
如果sparkstream暂时不能做到跨批次的join,那么若是我们自己做的话,一般的思路是怎样的?
The text was updated successfully, but these errors were encountered: