To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ??擁衙??擁衙^ 00111111001111111001011101101001111001011100100100111111001111111001011101101001111001011100100101011110 3f3f9769e5c93f3f9769e5c95e
EUC-JP ??擁衙??擁衙^ 00111111001111111100110111001010111010101100101100111111001111111100110111001010111010101100101101011110 3f3fcdcaeacb3f3fcdcaeacb5e
UTF-8 솜셈擁衙솜셈擁衙^ 11101100100001101001110011101100100001011000100011100110100100111000000111101000101000011001100111101100100001101001110011101100100001011000100011100110100100111000000111101000101000011001100101011110 ec869cec8588e69381e8a199ec869cec8588e69381e8a1995e
UHC 솜셈擁衙솜셈擁衙^ 1011110011011000101111001100000011101000101101101110010010110111101111001101100010111100110000001110100010110110111001001011011101011110 bcd8bcc0e8b6e4b7bcd8bcc0e8b6e4b75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)