To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????SB 0011111100111111001111110011111100111111001111110101001101000010 3f3f3f3f3f3f5342
SJIS-WIN ?楔?僊雪姓SB 001111111001111010110110001111111001100101000001100100001110000110010000101010010101001101000010 3f9eb63f994190e190a95342
EUC-JP ?楔宬僊雪姓SB 0011111111011100101110001000111110111010110100111101000110100010110000001110001111000000101010110101001101000010 3fdcb88fbad3d1a2c0e3c0ab5342
UTF-8 說楔宬僊雪姓SB 1110100010101010101010101110011010100101100101001110010110101110101011001110010110000011100010101110100110011011101010101110010110100111100100110101001101000010 e8aaaae6a594e5aeace5838ae99baae5a7935342
UHC 說楔宬僊雪姓SB 1110000011100011111000001101101111100000111101001110000010111010111000001110010011100000111100110101001101000010 e0e3e0dbe0f4e0bae0e4e0f35342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)