To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?西????德????□?潔?絞東????^ 00111111100100001011110000111111001111110011111100111111111110101011101000111111001111110011111100111111100000011010000000111111100011001000100100111111100011010110100110010011100011000011111100111111001111110011111101011110 3f90bc3f3f3f3ffaba3f3f3f3f81a03f8c893f8d69938c3f3f3f3f5e
EUC-JP ?西?????????□?潔?絞東????^ 001111111100000010111110001111110011111100111111001111110011111100111111001111110011111100111111101000101010001000111111101101111110100100111111101110011100101011000101111011000011111100111111001111110011111101011110 3fc0be3f3f3f3f3f3f3f3f3fa2a23fb7e93fb9cac5ec3f3f3f3f5e
UTF-8 렊西롆쒔롉뤦德찊첸춲첁□쨴潔춲絞東렔렻쒔렕^ 11101011101000001000101011101000101001011011111111101011101000011000011011101100100100101001010011101011101000011000100111101011101001001010011011100101101111101011011111101100101100001000101011101100101100101011100011101100101101101011001011101100101100101000000111100010100101101010000111101100101010001011010011100110101111011001010011101100101101101011001011100111101101011001111011100110100111011011000111101011101000001001010011101011101000001011101111101100100100101001010011101011101000001001010101011110 eba08ae8a5bfeba186ec9294eba189eba4a6e5beb7ecb08aecb2b8ecb6b2ecb281e296a1eca8b4e6bd94ecb6b2e7b59ee69db1eba094eba0bbec9294eba0955e
UHC 렊西롆쒔롉뤦德찊첸춲첁□쨴潔춲絞東렔렻쒔렕^ 10001110101000011110000010100100100011101100110010111110101011011000111011001111100011111101010011010011111011001010100110001110110000111011111010101101100011101010101010001110101000011110000010100100100011101100110010111110101011011000111011001110111011011101010011010100100011101010100110001110110000111011111010101101100011101010101001011110 8ea1e0a48eccbead8ecf8fd4d3eca98ec3bead8eaa8ea1e0a48eccbead8eceedd4d48ea98ec3bead8eaa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)