To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????T 001111110011111100111111001111110011111100111111001111110011111101010100 3f3f3f3f3f3f3f3f54
SJIS-WIN 栓賂??趙???T 100100001111000010011000010001110011111100111111111001101110001000111111001111110011111101010100 90f098473f3fe6e23f3f3f54
EUC-JP 栓賂??趙???T 110000001111001011001111101010000011111100111111111011001110010000111111001111110011111101010100 c0f2cfa83f3fece43f3f3f54
UTF-8 栓賂렰렡趙뀜렰렭T 11100110101000001001001111101000101100111000001011101011101000001011000011101011101000001010000111101000101101101001100111101011100000001001110011101011101000001011000011101011101000001010110101010100 e6a093e8b382eba0b0eba0a1e8b699eb809ceba0b0eba0ad54
UHC 栓賂렰렡趙뀜렰렭T 1110111011111011110101101111000110001110101111011000111010110010111100001110000110110010111100011000111010111101100011101011101001010100 eefbd6f18ebd8eb2f0e1b2f18ebd8eba54

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)