To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???倭??娃???с 0011111100111111001111111001100001100000001111110011111110001000101000010011111100111111001111111000010010000011 3f3f3f98603f3f88a13f3f3f8483
EUC-JP ???倭??娃???с 0011111100111111001111111100111111000001001111110011111110110000101000110011111100111111001111111010011111100011 3f3f3fcfc13f3fb0a33f3f3fa7e3
UTF-8 了쀯쉠倭뗰숱娃묕슈料с 1110111110100110101110101110110010000000101011111110110010001001101000001110010110000000101011011110101110010111101100001110110010001000101100011110010110101000100000111110101110101100100101011110110010001010100010001110111110100110101111101101000110000001 efa6baec80afec89a0e580adeb97b0ec88b1e5a883ebac95ec8a88efa6bed181
UHC 了쀯쉠倭뗰숱娃묕슈料с 11101000111001111001011111101111101111011010101011101000110111101000101111101111101111011010001011101000110111111001000111101111101111011011010011101000111101111010110011100011 e8e797efbdaae8de8befbda2e8df91efbdb4e8f7ace3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)