📐

Manhattan Distance (KNN)

Step-by-step Manhattan distance calculations

X Value

Y Value

grid-like data বা অনেকগুলো dimension থাকলে Manhattan distance ব্যবহার করা হয়।

d = |x₁ − x₂| + |y₁ − y₂|

💡 সহজ ব্যাখ্যা:

(x₁, y₁) হলো আমাদের Target Data বা নতুন Point।
(x₂, y₂) হলো আগে থেকে দেয়া Sample Data।
আমরা প্রতিটি Sample এর সাথে নতুন Point টির x এবং y এর পার্থক্য (বিয়োগফল) বের করে তার পরম মান (absolute value) যোগ করে Distance বের করি।

এই formula দিয়ে নতুন point থেকে সব sample এর distance বের করবো।

নতুন point = (3, 7)

💡 ধাপ: প্রথমে X এর পার্থক্য (|7 − 3| = 4) এবং Y এর পার্থক্য (|7 − 7| = 0) বের করে যোগ করা হলো।

Distance

= |7 − 3| + |7 − 7|

= |4| + |0|

= 4 + 0

= 4

💡 ধাপ: প্রথমে X এর পার্থক্য (|7 − 3| = 4) এবং Y এর পার্থক্য (|4 − 7| = 3) বের করে যোগ করা হলো।

Distance

= |7 − 3| + |4 − 7|

= |4| + |-3|

= 4 + 3

= 7

পরীক্ষার খাতায় ঠিক এই ফরম্যাটে টেবিল করে উত্তর লিখলে ভালো নম্বর পাওয়া যাবে:

Name	X Value	Y Value	Class	Distance
Sample 1	7	7	Bad	\|7−3\| + \|7−7\| = 4
Sample 2	7	4	Bad	\|7−3\| + \|4−7\| = 7

যেহেতু K = 3, তাই আমরা সবচেয়ে কাছের (কম distance) 3 টি neighbor বা sample নিবো:

✅ চূড়ান্ত সিদ্ধান্ত (Final Prediction):

উপরের এই 3 টি Nearest Neighbors এর মধ্যে যে Label টি সবচেয়ে বেশিবার আছে (Majority Vote), আমাদের নতুন Point টির Label ও সেটিই হবে!

Use Euclidean distance if your data is continuous and evenly scaled.
Use Manhattan distance if your data has many dimensions or is grid-like.
For most business or simple classification problems, Euclidean distance is more appropriate and widely used.