Skip to content

Commit f09a8d8

Browse files
find top-N and bottom-N
1 parent af44b7a commit f09a8d8

File tree

1 file changed

+24
-0
lines changed

1 file changed

+24
-0
lines changed

top-N/top-N.txt

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
# ./pyspark
2+
Python 2.6.9 (unknown, Sep 9 2014, 15:05:12)
3+
Welcome to
4+
____ __
5+
/ __/__ ___ _____/ /__
6+
_\ \/ _ \/ _ `/ __/ '_/
7+
/__ / .__/\_,_/_/ /_/\_\ version 1.2.0
8+
/_/
9+
10+
Using Python version 2.6.9 (unknown, Sep 9 2014 15:05:12)
11+
SparkContext available as sc.
12+
>>>
13+
>>> nums = [10, 1, 2, 9, 3, 4, 5, 6, 7]
14+
>>> sc.parallelize(nums).takeOrdered(3)
15+
[1, 2, 3]
16+
>>> sc.parallelize(nums).takeOrdered(3, key=lambda x: -x)
17+
[10, 9, 7]
18+
>>>
19+
>>> kv = >>> [(10,"z1"), (1,"z2"), (2,"z3"), (9,"z4"), (3,"z5"), (4,"z6"), (5,"z7"), (6,"z8"), (7,"z9")]
20+
>>> sc.parallelize(kv).takeOrdered(3)
21+
[(1, 'z2'), (2, 'z3'), (3, 'z5')]
22+
>>>
23+
>>> sc.parallelize(kv).takeOrdered(3, key=lambda x: -x[0])
24+
[(10, 'z1'), (9, 'z4'), (7, 'z9')]

0 commit comments

Comments
 (0)