To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?閥?乘????櫛?? 0011111110010100101101000011111110011000101010010011111100111111001111110011111110001011111110010011111100111111 3f94b43f98a93f3f3f3f8bf93f3f
EUC-JP ?閥?乘????櫛?? 0011111111001000101101100011111111010000101010110011111100111111001111110011111110110110111110110011111100111111 3fc8b63fd0ab3f3f3f3fb6fb3f3f
UTF-8 뤶閥툘乘렦씐렞렭櫛렱섐 111010111010010010110110111010011001011010100101111011011000100010011000111001001011100110011000111010111010000010100110111011001001010010010000111010111010000010011110111010111010000010101101111001101010101110011011111010111010000010110001111011001000010010010000 eba4b6e996a5ed8898e4b998eba0a6ec9490eba09eeba0ade6ab9beba0b1ec8490
UHC 뤶閥툘乘렦씐렞렭櫛렱섐 10001111111001001101101111101100101110001000111111100011101010111000111010110101101111101011101110001110101011111000111010111010111100011110111010001110101111101011110010101011 8fe4dbecb88fe3ab8eb5bebb8eaf8ebaf1ee8ebebcab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)