To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 擬??櫻????擬?? 1000101101011011001111110011111110011111010011100011111100111111001111110011111110001011010110110011111100111111 8b5b3f3f9f4e3f3f3f3f8b5b3f3f
EUC-JP 擬??櫻????擬?? 1011010110111100001111110011111111011101101011110011111100111111001111110011111110110101101111000011111100111111 b5bc3f3fddaf3f3f3f3fb5bc3f3f
UTF-8 擬묐죧櫻얜죾溜볾擬묐죧 111001101001001110101100111010111010110010010000111011001010001110100111111001101010101110111011111011001001011010011100111011001010001110111110111011111010011110001011111010111011001110111110111001101001001110101100111010111010110010010000111011001010001110100111 e693acebac90eca3a7e6abbbec969ceca3beefa78bebb3bee693acebac90eca3a7
UHC 擬묐죧櫻얜죾溜볾擬묐죧 11101011111101001001000111101011101000011000001011100101101000011011111011101011101000011001011011101010111111101001010001000001111010111111010010010001111010111010000110000010 ebf491eba182e5a1beeba196eafe9441ebf491eba182

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)