To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 絶??節??菴у?v絶??節??菴у?vB 1001000011100010001111110011111110010000110111110011111100111111111001001011110110000100100001010011111101110110100100001110001000111111001111111001000011011111001111110011111111100100101111011000010010000101001111110111011001000010 90e23f3f90df3f3fe4bd84853f7690e23f3f90df3f3fe4bd84853f7642
EUC-JP 絶??節??菴у?v絶??節??菴у?vB 1100000011100100001111110011111111000000111000010011111100111111111010001011111110100111111001010011111101110110110000001110010000111111001111111100000011100001001111110011111111101000101111111010011111100101001111110111011001000010 c0e43f3fc0e13f3fe8bfa7e53f76c0e43f3fc0e13f3fe8bfa7e53f7642
UTF-8 絶껅툒節놂푵菴у눀v絶껅툒節놂푵菴у눀vB 11100111101101011011011011101010101110111000010111101101100010001001001011100111101011111000000011101011100001101000001011101101100100011011010111101000100011111011010011010001100000111110101110001000100000000111011011100111101101011011011011101010101110111000010111101101100010001001001011100111101011111000000011101011100001101000001011101101100100011011010111101000100011111011010011010001100000111110101110001000100000000111011001000010 e7b5b6eabb85ed8892e7af80eb8682ed91b5e88fb4d183eb888076e7b5b6eabb85ed8892e7af80eb8682ed91b5e88fb4d183eb88807642
UHC 絶껅툒節놂푵菴у눀v絶껅툒節놂푵菴у눀vB 111011111011111010000011111001101011100010001001111011111011110110110011111011111011111010000011111001001110000010101100111001011000011110100001011101101110111110111110100000111110011010111000100010011110111110111101101100111110111110111110100000111110010011100000101011001110010110000111101000010111011001000010 efbe83e6b889efbdb3efbe83e4e0ace587a176efbe83e6b889efbdb3efbe83e4e0ace587a17642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)