To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 û¹—nW}û¹—nW{^ 11111011101110011001011101101110010101110111110111111011101110011001011101101110010101110111101101011110 fbb9976e577dfbb9976e577b5e
SJIS-WIN ???nW}???nW{^ 00111111001111110011111101101110010101110111110100111111001111110011111101101110010101110111101101011110 3f3f3f6e577d3f3f3f6e577b5e
EUC-JP û??nW}û??nW{^ 1000111110101011111001010011111100111111011011100101011101111101100011111010101111100101001111110011111101101110010101110111101101011110 8fabe53f3f6e577d8fabe53f3f6e577b5e
UTF-8 û¹—nW}û¹—nW{^ 11000011101110111100001010111001110000101001011101101110010101110111110111000011101110111100001010111001110000101001011101101110010101110111101101011110 c3bbc2b9c2976e577dc3bbc2b9c2976e577b5e
UHC ?¹?nW}?¹?nW{^ 001111111010100111110110001111110110111001010111011111010011111110101001111101100011111101101110010101110111101101011110 3fa9f63f6e577d3fa9f63f6e577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)