To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 厓ら????繹??v厓ら????繹??vB 111110101000110110000010111001110011111100111111001111110011111111100011100010000011111100111111011101101111101010001101100000101110011100111111001111110011111100111111111000111000100000111111001111110111011001000010 fa8d82e73f3f3f3fe3883f3f76fa8d82e73f3f3f3fe3883f3f7642
EUC-JP 厓ら????繹??v厓ら????繹??vB 1000111110110100110001111010010011101001001111110011111100111111001111111110010111101000001111110011111101110110100011111011010011000111101001001110100100111111001111110011111100111111111001011110100000111111001111110111011001000010 8fb4c7a4e93f3f3f3fe5e83f3f768fb4c7a4e93f3f3f3fe5e83f3f7642
UTF-8 厓ら솈溜볥젌繹먮젪v厓ら솈溜볥젌繹먮젪vB 111001011000111010010011111000111000001010001001111011001000011010001000111011111010011110001011111010111011001110100101111011001010000010001100111001111011100110111001111010111010100010101110111011001010000010101010011101101110010110001110100100111110001110000010100010011110110010000110100010001110111110100111100010111110101110110011101001011110110010100000100011001110011110111001101110011110101110101000101011101110110010100000101010100111011001000010 e58e93e38289ec8688efa78bebb3a5eca08ce7b9b9eba8aeeca0aa76e58e93e38289ec8688efa78bebb3a5eca08ce7b9b9eba8aeeca0aa7642
UHC 厓ら솈溜볥젌繹먮젪v厓ら솈溜볥젌繹먮젪vB 111001001110110110101010111010011001100110001100111010101111111010010011111010111010000010001101111001101011101010010000111010111010000010100010011101101110010011101101101010101110100110011001100011001110101011111110100100111110101110100000100011011110011010111010100100001110101110100000101000100111011001000010 e4edaae9998ceafe93eba08de6ba90eba0a276e4edaae9998ceafe93eba08de6ba90eba0a27642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)