๋ฐ˜์‘ํ˜•
kkh1902
Steadily
kkh1902
์ „์ฒด ๋ฐฉ๋ฌธ์ž
์˜ค๋Š˜
์–ด์ œ
  • ๋ถ„๋ฅ˜ ์ „์ฒด๋ณด๊ธฐ (178) N
    • DataEngineering (20) N
      • Spark (7) N
      • Airflow (2) N
      • DBT (2) N
      • Architecture (3) N
      • Data Quality (5) N
      • Infra (1) N
    • ๐Ÿค– AI (12) N
      • ML (7)
      • DL (0)
      • LLM (5) N
    • ๐Ÿ“š Study (74)
      • DataEngineering (0)
      • Spring (9)
      • Java (2)
      • Html, css (10)
      • JS, JQuery (29)
      • DB (5)
      • DevOps (13)
      • roadmap (2)
      • Architecture (1)
      • Flutter (2)
    • ๐Ÿ’ป Computer Science (28)
      • Datastructure (0)
      • Algorithm (2)
      • Design pattern (0)
      • Network (1)
      • DB (13)
      • Operating System (0)
      • Software Engineering (4)
      • CS interview (5)
      • topcit (3)
    • โš’๏ธ Etc (6)
      • Error (3)
      • Trouble_Shooting (2)
      • Dev_environment (1)
    • ๐Ÿ“ฐ News (24)
      • daily (7)
      • think (17)
    • ๐Ÿ“˜ Hobby (13)
      • English (13)

๋ธ”๋กœ๊ทธ ๋ฉ”๋‰ด

  • ๐Ÿ“‹ ์ด๋ ฅ์„œ
  • โšก๏ธ ๊นƒํ—ˆ๋ธŒ
  • ํƒœ๊ทธ
  • ๋ฐฉ๋ช…๋ก

๊ณต์ง€์‚ฌํ•ญ

์ธ๊ธฐ ๊ธ€

ํƒœ๊ทธ

  • React๋ฅผ ๋ฐฐ์›Œ์•ผํ•˜๋Š” ์ด์œ 
  • Qr_payment project # CSS ํ•ด์„ # Basic ๋งจ์œ„ ํ•ด์„
  • git stash
  • testcode
  • React JS # 2 The Basic of React
  • git
  • Flutter
  • React JS # ์ž์Šต์„œ # Component์™€ Props
  • SpringBootTest
  • Wonder # word
  • think #bootstrap์„ ์จ์•ผํ•˜๋Š” ์ด์œ 
  • sourcetreee
  • db
  • ์†Œํ”„ํŠธ์›จ์–ด ๊ณตํ•™ # chapter1
  • ์†Œํ”„ํŠธ์›จ์–ด ๊ณตํ•™ #project๋งŒ๋“ค๋•Œ ์ค‘์š”
  • junit5
  • React JS #์ž์Šต์„œ
  • React # JSX
  • gitaction
  • Linear Regression

์ตœ๊ทผ ๋Œ“๊ธ€

์ตœ๊ทผ ๊ธ€

ํ‹ฐ์Šคํ† ๋ฆฌ

250x250
hELLO ยท Designed By ์ •์ƒ์šฐ.
๊ธ€์“ฐ๊ธฐ / ๊ด€๋ฆฌ์ž
kkh1902

Steadily

DataEngineering/Spark

Lazy Evaluation ์ด๋ž€?

2026. 1. 30. 19:36
728x90
๋ฐ˜์‘ํ˜•

ํ•œ ์ค„ ๋น„์œ 

๐Ÿ‘‰ “ํ•  ์ผ ๋ชฉ๋ก๋งŒ ์ ์–ด๋‘๊ณ ,์ง„์งœ ์ผ์€ ๋งˆ์ง€๋ง‰์— ํ•œ ๋ฒˆ์— ํ•œ๋‹ค.”



Lazy Evaluation ์ด๋ž€?

๐Ÿ‘‰ ์—ฐ์‚ฐ์„ ์ฆ‰์‹œ ์‹คํ–‰ํ•˜์ง€ ์•Š๊ณ ,‘์‹คํ–‰ ๊ณ„ํš(DAG)’๋งŒ ์Œ“์•„๋‘๋Š” ๋ฐฉ์‹

Spark์—์„œ๋Š”:

  • filter
  • select
  • join

๊ฐ™์€ ๊ฑด ๋ฐ”๋กœ ์‹คํ–‰ ์•ˆ ํ•จ


์–ธ์ œ ์ง„์งœ ์‹คํ–‰๋˜๋‚˜?

๐Ÿ‘‰ Action์„ ๋งŒ๋‚ฌ์„ ๋•Œ

Action ์˜ˆ์‹œ

  • count()
  • show()
  • collect()
  • write()

์ด ์ˆœ๊ฐ„์— Spark๊ฐ€:

  1. ์ง€๊ธˆ๊นŒ์ง€ ์Œ“์ธ ์—ฐ์‚ฐ์„ ๋ณด๊ณ 
  2. ์‹คํ–‰ ๊ณ„ํš์„ ์ตœ์ ํ™”ํ•˜๊ณ 
  3. ํ•œ ๋ฒˆ์— ์‹คํ–‰

์˜ˆ์ œ๋กœ ๋ณด๋ฉด ๋ฐ”๋กœ ์ดํ•ด๋จ

df2 = (
    df
    .filter(df.price > 100)
    .select("user_id", "price")
)

๐Ÿ‘‰ ์—ฌ๊ธฐ๊นŒ์ง€๋Š” ์•„๋ฌด ์ผ๋„ ์•ˆ ํ•จ

df2.count()

๐Ÿ‘‰ ์ด๋•Œ์„œ์•ผ ์ง„์งœ ์‹คํ–‰


์™œ ์ด๋ ‡๊ฒŒ ํ•˜๋ƒ?

1๏ธโƒฃ ์ตœ์ ํ™” ๊ฐ€๋Šฅ

  • ๋ถˆํ•„์š”ํ•œ ์—ฐ์‚ฐ ์ œ๊ฑฐ
  • ์กฐ๊ฑด ๋จผ์ € ์ ์šฉ
  • ์ปฌ๋Ÿผ ์ตœ์†Œ ์ฝ๊ธฐ

๐Ÿ‘‰ ์ž๋™์œผ๋กœ ๋น ๋ฅด๊ฒŒ


2๏ธโƒฃ ์‹คํŒจ ์ค„์ด๊ธฐ

  • ์ค‘๊ฐ„ ๊ฒฐ๊ณผ ์ €์žฅ ์•ˆ ํ•จ
  • ํ•„์š”ํ•  ๋•Œ๋งŒ ๊ณ„์‚ฐ

3๏ธโƒฃ ๋ถ„์‚ฐ ์ฒ˜๋ฆฌ์— ์œ ๋ฆฌ

  • ์ „์ฒด DAG ๋ณด๊ณ 
  • ํŒŒํ‹ฐ์…˜/์…”ํ”Œ ์ „๋žต ์ตœ์ ํ™”

Lazy vs Eager (์ฆ‰์‹œ ์‹คํ–‰)

๊ตฌ๋ถ„ Lazy Eager

์‹คํ–‰ ์‹œ์  action ์‹œ ์ฆ‰์‹œ
์ตœ์ ํ™” ๊ฐ€๋Šฅ ๊ฑฐ์˜ ์—†์Œ
Spark โœ… โŒ
Pandas โŒ โœ…

Lazy Evaluation์˜ ๋ถ€์ž‘์šฉ

๐Ÿ‘‰ ์ฒ˜์Œ ์“ฐ๋ฉด ํ—ท๊ฐˆ๋ฆผ

  • print ์ฐ์—ˆ๋Š”๋ฐ ์•ˆ ๋‚˜์˜ด
  • cache ํ–ˆ๋Š”๋ฐ ํšจ๊ณผ ์—†๋Š” ๊ฒƒ ๊ฐ™์Œ

โ— cache๋„ action ์žˆ์–ด์•ผ ์‹ค์ œ๋กœ ์ €์žฅ๋จ


๋ฉด์ ‘์šฉ 20์ดˆ ๋‹ต๋ณ€

“Lazy evaluation์€ Spark๊ฐ€ ๋ณ€ํ™˜ ์—ฐ์‚ฐ์„ ์ฆ‰์‹œ ์‹คํ–‰ํ•˜์ง€ ์•Š๊ณ 

์‹คํ–‰ ๊ณ„ํš์œผ๋กœ๋งŒ ์Œ“์•„๋‘์—ˆ๋‹ค๊ฐ€,

action์ด ํ˜ธ์ถœ๋  ๋•Œ ์ „์ฒด DAG๋ฅผ ์ตœ์ ํ™”ํ•ด ํ•œ ๋ฒˆ์— ์‹คํ–‰ํ•˜๋Š” ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค.”


๋งˆ์ง€๋ง‰ ํ•ต์‹ฌ ๋ฌธ์žฅ

๐Ÿ‘‰ Lazy evaluation ๋•๋ถ„์— Spark๋Š” ๋น ๋ฅด๋‹ค.


728x90
๋ฐ˜์‘ํ˜•

'DataEngineering > Spark' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

Spark ์ตœ์ ํ™” ์ฒดํฌ๋ฆฌ์ŠคํŠธ  (0) 2026.01.30
Suffle์ด๋ž€?  (1) 2026.01.30
df.cache() ๋ฅผ ์–ธ์ œ ์จ์•ผํ•˜๋‚˜?  (0) 2026.01.30
Dataframe ์ด๋ž€?  (0) 2026.01.30
Spark ๋ž€?  (0) 2026.01.30
    'DataEngineering/Spark' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€
    • Suffle์ด๋ž€?
    • df.cache() ๋ฅผ ์–ธ์ œ ์จ์•ผํ•˜๋‚˜?
    • Dataframe ์ด๋ž€?
    • Spark ๋ž€?
    kkh1902
    kkh1902
    1Day 1 Commit ๋ชฉํ‘œ ๊ณต๋ถ€ํ•œ๊ฒƒ๋“ค ๋งค์ผ ๊ธฐ๋กํ•˜๊ธฐ

    ํ‹ฐ์Šคํ† ๋ฆฌํˆด๋ฐ”