To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????Ё円??B 001111110011111100111111001111110011111110000100010001101000100101111110001111110011111101000010 3f3f3f3f3f8446897e3f3f42
EUC-JP ?????Ё円??B 001111110011111100111111001111110011111110100111101001111011000111011111001111110011111101000010 3f3f3f3f3fa7a7b1df3f3f42
UTF-8 遼㎪솄若띺Ё円됧츒B 111011111010011110000011111000111000111010101010111011001000011010000100111011111010010110110100111010111001110110111010110100001000000111100101100001101000011011101011100100001010011111101100101110001001001001000010 efa783e38eaaec8684efa5b4eb9dbad081e58686eb90a7ecb89242
UHC 遼㎪솄若띺Ё円됧츒B 11101001101011001010011111100110100110011000100111100101101011101000110111101001101011001010011111100101111101111000100111100101101011101000110101000010 e9aca7e69989e5ae8de9aca7e5f789e5ae8d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)