To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 凹ε????業??v凹ε????業??vB 100010011001101010000011110000110011111100111111001111110011111110001011110001100011111100111111011101101000100110011010100000111100001100111111001111110011111100111111100010111100011000111111001111110111011001000010 899a83c33f3f3f3f8bc63f3f76899a83c33f3f3f3f8bc63f3f7642
EUC-JP 凹ε?獒??業??v凹ε?獒??業??vB 10110001111110101010011011000101001111111000111111001011101110110011111100111111101101101100100000111111001111110111011010110001111110101010011011000101001111111000111111001011101110110011111100111111101101101100100000111111001111110111011001000010 b1faa6c53f8fcbbb3f3fb6c83f3f76b1faa6c53f8fcbbb3f3fb6c83f3f7642
UTF-8 凹ε쳣獒붻콐業뺡엱v凹ε쳣獒붻콐業뺡엱vB 11100101100001111011100111001110101101011110110010110011101000111110011110001101100100101110101110110110101110111110110010111101100100001110011010100101101011011110101110111010101000011110110010010111101100010111011011100101100001111011100111001110101101011110110010110011101000111110011110001101100100101110101110110110101110111110110010111101100100001110011010100101101011011110101110111010101000011110110010010111101100010111011001000010 e587b9ceb5ecb3a3e78d92ebb6bbecbd90e6a5adebbaa1ec97b176e587b9ceb5ecb3a3e78d92ebb6bbecbd90e6a5adebbaa1ec97b17642
UHC 凹ε쳣獒붻콐業뺡엱v凹ε쳣獒붻콐業뺡엱vB 111010001110101010100101111001011010101110001001111010001010001110010100111010001011000110001100111001011111011010010101111010011001111010000110011101101110100011101010101001011110010110101011100010011110100010100011100101001110100010110001100011001110010111110110100101011110100110011110100001100111011001000010 e8eaa5e5ab89e8a394e8b18ce5f695e99e8676e8eaa5e5ab89e8a394e8b18ce5f695e99e867642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)