To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 小?橈小泄堯小 10001111101011000011111110011110111101001000111110101100100111111001010111101010100111111000111110101100 8fac3f9ef48fac9f95ea9f8fac
EUC-JP 小?橈小泄堯小 10111110101011100011111111011100111101101011111010101110110111011111010111110100101000011011111010101110 beae3fdcf6beaeddf5f4a1beae
UTF-8 小숸橈小泄堯小 111001011011000010001111111011001000100010111000111001101010100110001000111001011011000010001111111001101011001110000100111001011010000010101111111001011011000010001111 e5b08fec88b8e6a988e5b08fe6b384e5a0afe5b08f
UHC 小숸橈小泄堯小 1110000110110011100110100100110111101000111110101110000110110011111000001101110011101000111010111110000110110011 e1b39a4de8fae1b3e0dce8ebe1b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)