To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 如???ф?臾??[如???ф?臾??[^ 100101000100000000111111001111110011111110000100100001100011111111100100011010110011111100111111010110111001010001000000001111110011111100111111100001001000011000111111111001000110101100111111001111110101101101011110 94403f3f3f84863fe46b3f3f5b94403f3f3f84863fe46b3f3f5b5e
EUC-JP 如???ф?臾??[如???ф?臾??[^ 110001111010000100111111001111110011111110100111111001100011111111100111110011000011111100111111010110111100011110100001001111110011111100111111101001111110011000111111111001111100110000111111001111110101101101011110 c7a13f3f3fa7e63fe7cc3f3f5bc7a13f3f3fa7e63fe7cc3f3f5b5e
UTF-8 如붾쵈杻ф윍臾딅뼩[如붾쵈杻ф윍臾딅뼩[^ 11100101101001101000001011101011101101101011111011101100101101011000100011101111101001111000100011010001100001001110110010011100100011011110100010000111101111101110101110010100100001011110101110111100101010010101101111100101101001101000001011101011101101101011111011101100101101011000100011101111101001111000100011010001100001001110110010011100100011011110100010000111101111101110101110010100100001011110101110111100101010010101101101011110 e5a682ebb6beecb588efa788d184ec9c8de887beeb9485ebbca95be5a682ebb6beecb588efa788d184ec9c8de887beeb9485ebbca95b5e
UHC 如붾쵈杻ф윍臾딅뼩[如붾쵈杻ф윍臾딅뼩[^ 111001011111110110010100111010111010110010001010111010101111010010101100111001101001111110010100111010111010110010001010111010111001011010101100010110111110010111111101100101001110101110101100100010101110101011110100101011001110011010011111100101001110101110101100100010101110101110010110101011000101101101011110 e5fd94ebac8aeaf4ace69f94ebac8aeb96ac5be5fd94ebac8aeaf4ace69f94ebac8aeb96ac5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)