To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????n}???????????n{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 淨???遵?貞?┗助?n}淨???遵?貞?┗助?n{^ 10011111110001000011111100111111001111111000111110000101001111111001001011100101001111111000010010101111100011111001010100111111011011100111110110011111110001000011111100111111001111111000111110000101001111111001001011100101001111111000010010101111100011111001010100111111011011100111101101011110 9fc43f3f3f8f853f92e53f84af8f953f6e7d9fc43f3f3f8f853f92e53f84af8f953f6e7b5e
EUC-JP 淨???遵?貞?┗助?n}淨???遵?貞?┗助?n{^ 11011110110001100011111100111111001111111011110111100101001111111100010011100111001111111010100010110001101111011111010100111111011011100111110111011110110001100011111100111111001111111011110111100101001111111100010011100111001111111010100010110001101111011111010100111111011011100111101101011110 dec63f3f3fbde53fc4e73fa8b1bdf53f6e7ddec63f3f3fbde53fc4e73fa8b1bdf53f6e7b5e
UTF-8 淨렞渽렜遵렗貞흐┗助렞n}淨렞渽렜遵렗貞흐┗助렞n{^ 1110011010110111101010001110101110100000100111101110011010111000101111011110101110100000100111001110100110000001101101011110101110100000100101111110100010110010100111101110110110011101100100001110001010010100100101111110010110001010101010011110101110100000100111100110111001111101111001101011011110101000111010111010000010011110111001101011100010111101111010111010000010011100111010011000000110110101111010111010000010010111111010001011001010011110111011011001110110010000111000101001010010010111111001011000101010101001111010111010000010011110011011100111101101011110 e6b7a8eba09ee6b8bdeba09ce981b5eba097e8b29eed9d90e29497e58aa9eba09e6e7de6b7a8eba09ee6b8bdeba09ce981b5eba097e8b29eed9d90e29497e58aa9eba09e6e7b5e
UHC 淨렞渽렜遵렗貞흐┗助렞n}淨렞渽렜遵렗貞흐┗助렞n{^ 11101111111001001000111010101111111011101010101010001110101011101111000111100101100011101010110011101111111101101100100011100101101001101011000111110000101111101000111010101111011011100111110111101111111001001000111010101111111011101010101010001110101011101111000111100101100011101010110011101111111101101100100011100101101001101011000111110000101111101000111010101111011011100111101101011110 efe48eafeeaa8eaef1e58eaceff6c8e5a6b1f0be8eaf6e7defe48eafeeaa8eaef1e58eaceff6c8e5a6b1f0be8eaf6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)