To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 茫√1鐔削州經э蹴v茫√1鐔削州經э蹴vB 111001001010100110000001111000111000001001010000111010000101110010001101111011011000111101000010111000110101001110000100100011111000111101010010011101101110010010101001100000011110001110000010010100001110100001011100100011011110110110001111010000101110001101010011100001001000111110001111010100100111011001000010 e4a981e38250e85c8ded8f42e353848f8f5276e4a981e38250e85c8ded8f42e353848f8f527642
EUC-JP 茫√1鐔削州經э蹴v茫√1鐔削州經э蹴vB 111010001010101110100010111001011010001110110001111011111011110110111010111011111011110110100011111001011011010010100111111011111011110110110011011101101110100010101011101000101110010110100011101100011110111110111101101110101110111110111101101000111110010110110100101001111110111110111101101100110111011001000010 e8aba2e5a3b1efbdbaefbda3e5b4a7efbdb376e8aba2e5a3b1efbdbaefbda3e5b4a7efbdb37642
UTF-8 茫√1鐔削州經э蹴v茫√1鐔削州經э蹴vB 11101000100011001010101111100010100010001001101011101111101111001001000111101001100100001001010011100101100010011000101011100101101101111001111011100111101101101001001111010001100011011110100010111001101101000111011011101000100011001010101111100010100010001001101011101111101111001001000111101001100100001001010011100101100010011000101011100101101101111001111011100111101101101001001111010001100011011110100010111001101101000111011001000010 e88cabe2889aefbc91e99094e5898ae5b79ee7b693d18de8b9b476e88cabe2889aefbc91e99094e5898ae5b79ee7b693d18de8b9b47642
UHC 茫√1?削州經э蹴v茫√1?削州經э蹴vB 11011000110101001010000111101110101000111011000100111111110111101111101111110001101101101100110011101000101011001110111111110101111011010111011011011000110101001010000111101110101000111011000100111111110111101111101111110001101101101100110011101000101011001110111111110101111011010111011001000010 d8d4a1eea3b13fdefbf1b6cce8aceff5ed76d8d4a1eea3b13fdefbf1b6cce8aceff5ed7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)