To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????}???????????{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 趙???蹄???澱私袞}趙???蹄???澱私袞{^ 1110011011100010001111110011111100111111100100101111101100111111001111110011111110010011011000101000111010000100111001011100111101111101111001101110001000111111001111110011111110010010111110110011111100111111001111111001001101100010100011101000010011100101110011110111101101011110 e6e23f3f3f92fb3f3f3f93628e84e5cf7de6e23f3f3f92fb3f3f3f93628e84e5cf7b5e
EUC-JP 趙???蹄???澱私袞}趙???蹄???澱私袞{^ 1110110011100100001111110011111100111111110001001111110100111111001111110011111111000101110000111011101111100100111010101101000101111101111011001110010000111111001111110011111111000100111111010011111100111111001111111100010111000011101110111110010011101010110100010111101101011110 ece43f3f3fc4fd3f3f3fc5c3bbe4ead17dece43f3f3fc4fd3f3f3fc5c3bbe4ead17b5e
UTF-8 趙얹렰렚蹄ㆁ렰렗澱私袞}趙얹렰렚蹄ㆁ렰렗澱私袞{^ 111010001011011010011001111011001001011010111001111010111010000010110000111010111010000010011010111010001011100110000100111000111000011010000001111010111010000010110000111010111010000010010111111001101011111010110001111001111010011110000001111010001010001010011110011111011110100010110110100110011110110010010110101110011110101110100000101100001110101110100000100110101110100010111001100001001110001110000110100000011110101110100000101100001110101110100000100101111110011010111110101100011110011110100111100000011110100010100010100111100111101101011110 e8b699ec96b9eba0b0eba09ae8b984e38681eba0b0eba097e6beb1e7a781e8a29e7de8b699ec96b9eba0b0eba09ae8b984e38681eba0b0eba097e6beb1e7a781e8a29e7b5e
UHC 趙얹렰렚蹄ㆁ렰렗澱私袞}趙얹렰렚蹄ㆁ렰렗澱私袞{^ 1111000011100001101111101111000110001110101111011000111010101101111100001011010010100100111100011000111010111101100011101010110011101110111111101101111011100111110011011110010101111101111100001110000110111110111100011000111010111101100011101010110111110000101101001010010011110001100011101011110110001110101011001110111011111110110111101110011111001101111001010111101101011110 f0e1bef18ebd8eadf0b4a4f18ebd8eaceefedee7cde57df0e1bef18ebd8eadf0b4a4f18ebd8eaceefedee7cde57b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)