To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???有??蟻??v???有??蟻??vB 00111111001111110011111110010111010011000011111100111111100010110110000100111111001111110111011000111111001111110011111110010111010011000011111100111111100010110110000100111111001111110111011001000010 3f3f3f974c3f3f8b613f3f763f3f3f974c3f3f8b613f3f7642
EUC-JP ???有??蟻??v???有??蟻??vB 00111111001111110011111111001101101011010011111100111111101101011100001000111111001111110111011000111111001111110011111111001101101011010011111100111111101101011100001000111111001111110111011001000010 3f3f3fcdad3f3fb5c23f3f763f3f3fcdad3f3fb5c23f3f7642
UTF-8 銳㏓맫有며독蟻믩릉v銳㏓맫有며독蟻믩릉vB 111010011000101010110011111000111000111110010011111010111010011110101011111001101001110010001001111010111010100110110000111010111000111110000101111010001001111110111011111010111010111110101001111010111010011010001001011101101110100110001010101100111110001110001111100100111110101110100111101010111110011010011100100010011110101110101001101100001110101110001111100001011110100010011111101110111110101110101111101010011110101110100110100010010111011001000010 e98ab3e38f93eba7abe69c89eba9b0eb8f85e89fbbebafa9eba68976e98ab3e38f93eba7abe69c89eba9b0eb8f85e89fbbebafa9eba6897642
UHC 銳㏓맫有며독蟻믩릉v銳㏓맫有며독蟻믩릉vB 111001111110010110100111111010111001000010110011111010101111001110111000111001111011010110110110111010111111110010010010111010111011100010101010011101101110011111100101101001111110101110010000101100111110101011110011101110001110011110110101101101101110101111111100100100101110101110111000101010100111011001000010 e7e5a7eb90b3eaf3b8e7b5b6ebfc92ebb8aa76e7e5a7eb90b3eaf3b8e7b5b6ebfc92ebb8aa7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)