To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 茫√1郤削州箜э蹴v茫√1郤削州箜э蹴vB 111001001010100110000001111000111000001001010000111001111011101010001101111011011000111101000010111000101011000110000100100011111000111101010010011101101110010010101001100000011110001110000010010100001110011110111010100011011110110110001111010000101110001010110001100001001000111110001111010100100111011001000010 e4a981e38250e7ba8ded8f42e2b1848f8f5276e4a981e38250e7ba8ded8f42e2b1848f8f527642
EUC-JP 茫√1郤削州箜э蹴v茫√1郤削州箜э蹴vB 111010001010101110100010111001011010001110110001111011101011110010111010111011111011110110100011111001001011001110100111111011111011110110110011011101101110100010101011101000101110010110100011101100011110111010111100101110101110111110111101101000111110010010110011101001111110111110111101101100110111011001000010 e8aba2e5a3b1eebcbaefbda3e4b3a7efbdb376e8aba2e5a3b1eebcbaefbda3e4b3a7efbdb37642
UTF-8 茫√1郤削州箜э蹴v茫√1郤削州箜э蹴vB 11101000100011001010101111100010100010001001101011101111101111001001000111101001100000111010010011100101100010011000101011100101101101111001111011100111101011101001110011010001100011011110100010111001101101000111011011101000100011001010101111100010100010001001101011101111101111001001000111101001100000111010010011100101100010011000101011100101101101111001111011100111101011101001110011010001100011011110100010111001101101000111011001000010 e88cabe2889aefbc91e983a4e5898ae5b79ee7ae9cd18de8b9b476e88cabe2889aefbc91e983a4e5898ae5b79ee7ae9cd18de8b9b47642
UHC 茫√1?削州?э蹴v茫√1?削州?э蹴vB 1101100011010100101000011110111010100011101100010011111111011110111110111111000110110110001111111010110011101111111101011110110101110110110110001101010010100001111011101010001110110001001111111101111011111011111100011011011000111111101011001110111111110101111011010111011001000010 d8d4a1eea3b13fdefbf1b63faceff5ed76d8d4a1eea3b13fdefbf1b63faceff5ed7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)