To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??æ?????æ?? 0011111100111111111001100011111100111111001111110011111100111111111001100011111100111111 3f3fe63f3f3f3f3fe63f3f
SJIS-WIN ????泥?????? 001111110011111100111111001111111001001101000100001111110011111100111111001111110011111100111111 3f3f3f3f93443f3f3f3f3f3f
EUC-JP ??æ?泥???æ?? 00111111001111111000111110101001110000010011111111000101101001010011111100111111001111111000111110101001110000010011111100111111 3f3f8fa9c13fc5a53f3f3f8fa9c13f3f
UTF-8 룶죴æ룶泥팬룶죴æ룶濫 11101011101000111011011011101100101000111011010011000011101001101110101110100011101101101110011010110011101001011110110110001100101011001110101110100011101101101110110010100011101101001100001110100110111010111010001110110110111011111010010010100010 eba3b6eca3b4c3a6eba3b6e6b3a5ed8caceba3b6eca3b4c3a6eba3b6efa4a2
UHC 룶죴æ룶泥팬룶죴æ룶濫 10001111101010111010000110001111101010011010000110001111101010111101001011111010110001101101001010001111101010111010000110001111101010011010000110001111101010111101000111111010 8faba18fa9a18fabd2fac6d28faba18fa9a18fabd1fa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)