To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 語???????? 10001100111010100011111100111111001111110011111100111111001111110011111100111111 8cea3f3f3f3f3f3f3f3f
EUC-JP 語???????? 10111000111011000011111100111111001111110011111100111111001111110011111100111111 b8ec3f3f3f3f3f3f3f3f
UTF-8 語ⓨ낡溜띹콊溜싦펿 111010001010101010011110111000101001001110101000111010111000001010100001111011111010011110001011111010111001110110111001111011001011110110001010111011111010011110001011111011001000101110100110111011011000111010111111 e8aa9ee293a8eb82a1efa78beb9db9ecbd8aefa78bec8ba6ed8ebf
UHC 語ⓨ낡溜띹콊溜싦펿 111001011101111010101000111001011011001110110000111010101111111010001101111010001011000110000110111010101111111010011010111001001011110010001110 e5dea8e5b3b0eafe8de8b186eafe9ae4bc8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)