To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?協?硫?艤際??B 0011111110001011101001100011111110010111101100000011111111100100011111101000110111011011001111110011111101000010 3f8ba63f97b03fe47e8ddb3f3f42
EUC-JP ?協?硫?艤際??B 0011111110110110101010000011111111001110101100100011111111100111110111111011101011011101001111110011111101000010 3fb6a83fceb23fe7dfbadd3f3f42
UTF-8 뤿協렐硫쪕艤際솖샵B 11101011101001001011111111100101100011011001010011101011101000001001000011100111101000011010101111101100101010101001010111101000100010011010010011101001100110101001101111101100100001101001011011101100100000111011010101000010 eba4bfe58d94eba090e7a1abecaa95e889a4e99a9bec8696ec83b542
UHC 뤿協렐硫쪕艤際솖샵B 10001111111010111111101011110000101101111011110011010111101111001010010110001111111010111111101011110000101101111011110011010111101111001010010101000010 8febfaf0b7bcd7bca58febfaf0b7bcd7bca542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)