To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??揄??巍リ????誘?????弛 00111111001111110011111111100010100001100011111100111111100111011000100100111111001111111001101111011001100000111000101000111111001111110011111100111111100101110101010100111111001111110011111100111111001111111001001001101111 3f3f3fe2863f3f9d893f3f9bd9838a3f3f3f3f97553f3f3f3f3f926f
EUC-JP ???竊??揄??巍リ????誘??孼??弛 001111110011111100111111111000111110011000111111001111111101100111101001001111110011111111010110110110111010010111101010001111110011111100111111001111111100110110110110001111110011111110001111101110101100001100111111001111111100001111010000 3f3f3fe3e63f3fd9e93f3fd6dba5ea3f3f3f3fcdb63f3f8fbac33f3fc3d0
UTF-8 捻뀁뮆竊섋린揄욱맪巍リ랜鱗꿨짃誘⑸쐝孼대씛弛 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001011111010111010011010110000111001101000111110000100111011001001101010110001111010111010011110101010111001011011011110001101111000111000001110101010111010111001111010011100111011111010011110110010111010101011111110101000111011001010011110000011111010001010101010011000111000101001000110111000111011001001000010011101111001011010110110111100111010111000110010000000111011001001010010011011111001011011110010011011 efa6a4eb8081ebae86e7ab8aec848beba6b0e68f84ec9ab1eba7aae5b78de383aaeb9e9cefa7b2eabfa8eca783e8aa98e291b8ec909de5adbceb8c80ec949be5bc9b
UHC 捻뀁뮆竊섋린揄욱맪巍リ랜鱗꿨짃誘⑸쐝孼대씛弛 1110011011110111101100101110110010010010100101011110111110111100100110001110100010111000101100001110101011110001101111111110110110010000101100101110100011100100101010111110101010110111101000111110110011100111101100101110010110100011100100111110101110101111101010011110101110011100100000111110010111101101101101001110101110011101101100001110110010101100 e6f7b2ec9295efbc98e8b8b0eaf1bfed90b2e8e4abeab7a3ece7b2e5a393ebafa9eb9c83e5edb4eb9db0ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)