To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 亨??褥ュ?褥ュ?D亨??褥ュ?褥ュ?D^ 10001011100111000011111100111111111001011111000110000011100001010011111111100101111100011000001110000101001111110100010010001011100111000011111100111111111001011111000110000011100001010011111111100101111100011000001110000101001111110100010001011110 8b9c3f3fe5f183853fe5f183853f448b9c3f3fe5f183853fe5f183853f445e
EUC-JP 亨??褥ュ?褥ュ?D亨??褥ュ?褥ュ?D^ 10110101111111000011111100111111111010101111001110100101111001010011111111101010111100111010010111100101001111110100010010110101111111000011111100111111111010101111001110100101111001010011111111101010111100111010010111100101001111110100010001011110 b5fc3f3feaf3a5e53feaf3a5e53f44b5fc3f3feaf3a5e53feaf3a5e53f445e
UTF-8 亨긷윿褥ュ츒褥ュ츒D亨긷윿褥ュ츒褥ュ츒D^ 111001001011101010101000111010101011100010110111111011001001110010111111111010001010010010100101111000111000001110100101111011001011100010010010111010001010010010100101111000111000001110100101111011001011100010010010010001001110010010111010101010001110101010111000101101111110110010011100101111111110100010100100101001011110001110000011101001011110110010111000100100101110100010100100101001011110001110000011101001011110110010111000100100100100010001011110 e4baa8eab8b7ec9cbfe8a4a5e383a5ecb892e8a4a5e383a5ecb89244e4baa8eab8b7ec9cbfe8a4a5e383a5ecb892e8a4a5e383a5ecb892445e
UHC 亨긷윿褥ュ츒褥ュ츒D亨긷윿褥ュ츒褥ュ츒D^ 111110101111101110110001111001011001111110110111111010011011001110101011111001011010111010001101111010011011001110101011111001011010111010001101010001001111101011111011101100011110010110011111101101111110100110110011101010111110010110101110100011011110100110110011101010111110010110101110100011010100010001011110 fafbb1e59fb7e9b3abe5ae8de9b3abe5ae8d44fafbb1e59fb7e9b3abe5ae8de9b3abe5ae8d445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)