To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???臆??阿??姚 00111111001111110011111110001001101100000011111100111111100010001010001000111111001111111001101101001100 3f3f3f89b03f3f88a23f3f9b4c
EUC-JP ???臆??阿??姚 00111111001111110011111110110010101100100011111100111111101100001010010000111111001111111101010110101101 3f3f3fb2b23f3fb0a43f3fd5ad
UTF-8 娛듽꺈臆롨뙃阿앭렢姚 111001011010100010011011111010111001001110111101111010101011101010001000111010001000011110000110111010111010000110101000111010111001100110000011111010011001100010111111111011001001010110101101111010111010000010100010111001011010011110011010 e5a89beb93bdeaba88e88786eba1a8eb9983e998bfec95adeba0a2e5a79a
UHC 娛듽꺈臆롨뙃阿앭렢姚 1110011111110100100010101110001110000011101011111110010111100110100011101110100010001100100010011110010010111001100111011110010110001110101100111110100011101110 e7f48ae383afe5e68ee88c89e4b99de58eb3e8ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)