To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????U????? 00111111001111110011111100111111010101010011111100111111001111110011111100111111 3f3f3f3f553f3f3f3f3f
SJIS-WIN 趙???U殿???有 11100110111000100011111100111111001111110101010110010011011000010011111100111111001111111001011101001100 e6e23f3f3f5593613f3f3f974c
EUC-JP 趙???U殿???有 11101100111001000011111100111111001111110101010111000101110000100011111100111111001111111100110110101101 ece43f3f3f55c5c23f3f3fcdad
UTF-8 趙얹렰렚U殿닸렲렧有 11101000101101101001100111101100100101101011100111101011101000001011000011101011101000001001101001010101111001101010111010111111111010111000101110111000111010111010000010110010111010111010000010100111111001101001110010001001 e8b699ec96b9eba0b0eba09a55e6aebfeb8bb8eba0b2eba0a7e69c89
UHC 趙얹렰렚U殿닸렲렧有 11110000111000011011111011110001100011101011110110001110101011010101010111101110111111001011010011100110100011101011111110001110101101101110101011110011 f0e1bef18ebd8ead55eefcb4e68ebf8eb6eaf3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)