To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 亨??褥ュ?褥ュ?W}亨??褥ュ?褥ュ?W{^ 100010111001110000111111001111111110010111110001100000111000010100111111111001011111000110000011100001010011111101010111011111011000101110011100001111110011111111100101111100011000001110000101001111111110010111110001100000111000010100111111010101110111101101011110 8b9c3f3fe5f183853fe5f183853f577d8b9c3f3fe5f183853fe5f183853f577b5e
EUC-JP 亨??褥ュ?褥ュ?W}亨??褥ュ?褥ュ?W{^ 101101011111110000111111001111111110101011110011101001011110010100111111111010101111001110100101111001010011111101010111011111011011010111111100001111110011111111101010111100111010010111100101001111111110101011110011101001011110010100111111010101110111101101011110 b5fc3f3feaf3a5e53feaf3a5e53f577db5fc3f3feaf3a5e53feaf3a5e53f577b5e
UTF-8 亨긷윿褥ュ츒褥ュ츒W}亨긷윿褥ュ츒褥ュ츒W{^ 1110010010111010101010001110101010111000101101111110110010011100101111111110100010100100101001011110001110000011101001011110110010111000100100101110100010100100101001011110001110000011101001011110110010111000100100100101011101111101111001001011101010101000111010101011100010110111111011001001110010111111111010001010010010100101111000111000001110100101111011001011100010010010111010001010010010100101111000111000001110100101111011001011100010010010010101110111101101011110 e4baa8eab8b7ec9cbfe8a4a5e383a5ecb892e8a4a5e383a5ecb892577de4baa8eab8b7ec9cbfe8a4a5e383a5ecb892e8a4a5e383a5ecb892577b5e
UHC 亨긷윿褥ュ츒褥ュ츒W}亨긷윿褥ュ츒褥ュ츒W{^ 1111101011111011101100011110010110011111101101111110100110110011101010111110010110101110100011011110100110110011101010111110010110101110100011010101011101111101111110101111101110110001111001011001111110110111111010011011001110101011111001011010111010001101111010011011001110101011111001011010111010001101010101110111101101011110 fafbb1e59fb7e9b3abe5ae8de9b3abe5ae8d577dfafbb1e59fb7e9b3abe5ae8de9b3abe5ae8d577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)