To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 仲弄???嶝??云??寃??嶝??云肌 100100101000011110011000010011010011111100111111001111111001101111010001001111110011111110001001010111010011111100111111100110111000001100111111001111111001101111010001001111110011111110001001010111011001010010100111 9287984d3f3f3f9bd13f3f895d3f3f9b833f3f9bd13f3f895d94a7
EUC-JP 仲弄???嶝??云??寃??嶝??云肌 110000111110011111001111101011100011111100111111001111111101011011010011001111110011111110110001101111100011111100111111110101011110001100111111001111111101011011010011001111110011111110110001101111101100100010101001 c3e7cfae3f3f3fd6d33f3fb1be3f3fd5e33f3fd6d33f3fb1bec8a9
UTF-8 仲弄렟꿩렯嶝렰렯云陋뇐寃꿩렯嶝렰렯云肌 111001001011101110110010111001011011110010000100111010111010000010011111111010101011111110101001111010111010000010101111111001011011011010011101111010111010000010110000111010111010000010101111111001001011101010010001111011111010010110010001111010111000011110010000111001011010111110000011111010101011111110101001111010111010000010101111111001011011011010011101111010111010000010110000111010111010000010101111111001001011101010010001111010001000001010001100 e4bbb2e5bc84eba09feabfa9eba0afe5b69deba0b0eba0afe4ba91efa591eb8790e5af83eabfa9eba0afe5b69deba0b0eba0afe4ba91e8828c
UHC 仲弄렟꿩렯嶝렰렯云陋뇐寃꿩렯嶝렰렯云肌 1111000111101010110101101110011110001110101100001011001011100110100011101011110011010100111100011000111010111101100011101011110011101001111101101101001011101011101100111111101111101010101100101011001011100110100011101011110011010100111100011000111010111101100011101011110011101001111101101101000110111111 f1ead6e78eb0b2e68ebcd4f18ebd8ebce9f6d2ebb3fbeab2b2e68ebcd4f18ebd8ebce9f6d1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)