To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??杷???◇?? 0011111100111111100101000110011000111111001111110011111110000001100111100011111100111111 3f3f94663f3f3f819e3f3f
EUC-JP ??杷???◇?? 0011111100111111110001111100011100111111001111110011111110100001111111100011111100111111 3f3fc7c73f3f3fa1fe3f3f
UTF-8 룶끝杷◐룴절◇룶끝 111010111010001110110110111010111000000110011101111001101001110110110111111000101001011110010000111010111010001110110100111011001010000010001000111000101001011110000111111010111010001110110110111010111000000110011101 eba3b6eb819de69db7e29790eba3b4eca088e29787eba3b6eb819d
UHC 룶끝杷◐룴절◇룶끝 100011111010101110110011101000011111011111101101101000101100010010001111101010011100000011111101101000011101111010001111101010111011001110100001 8fabb3a1f7eda2c48fa9c0fda1de8fabb3a1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)