To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?紗???爾???衣 00111111100011101101000100111111001111110011111110001110101000100011111100111111001111111000100011011111 3f8ed13f3f3f8ea23f3f3f88df
EUC-JP 馹紗???爾???衣 100011111110100110100001101111001101001100111111001111110011111110111100101001000011111100111111001111111011000011100001 8fe9a1bcd33f3f3fbca43f3f3fb0e1
UTF-8 馹紗듐麟렦爾잭롛렯衣 111010011010011010111001111001111011010010010111111010111001001110010000111011111010011110110011111010111010000010100110111001111000100010111110111011001001111010101101111010111010000110011011111010111010000010101111111010001010000110100011 e9a6b9e7b497eb9390efa7b3eba0a6e788beec9eadeba19beba0afe8a1a3
UHC 馹紗듐麟렦爾잭롛렯衣 1110110011110001110111101110100110110101111000111110110011101000100011101011010111101100101100111100000011101000100011101101111110001110101111001110101111111101 ecf1dee9b5e3ece88eb5ecb3c0e88edf8ebcebfd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)