To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 趙?咳憤趙?咳奮N}趙?咳憤趙?咳奮N{^ 111001101110001000111111100010100101000010010101101011101110011011100010001111111000101001010000100101011011000101001110011111011110011011100010001111111000101001010000100101011010111011100110111000100011111110001010010100001001010110110001010011100111101101011110 e6e23f8a5095aee6e23f8a5095b14e7de6e23f8a5095aee6e23f8a5095b14e7b5e
EUC-JP 趙?咳憤趙?咳奮N}趙?咳憤趙?咳奮N{^ 111011001110010000111111101100111011000111001010101100001110110011100100001111111011001110110001110010101011001101001110011111011110110011100100001111111011001110110001110010101011000011101100111001000011111110110011101100011100101010110011010011100111101101011110 ece43fb3b1cab0ece43fb3b1cab34e7dece43fb3b1cab0ece43fb3b1cab34e7b5e
UTF-8 趙렡咳憤趙렡咳奮N}趙렡咳憤趙렡咳奮N{^ 1110100010110110100110011110101110100000101000011110010110010010101100111110011010000110101001001110100010110110100110011110101110100000101000011110010110010010101100111110010110100101101011100100111001111101111010001011011010011001111010111010000010100001111001011001001010110011111001101000011010100100111010001011011010011001111010111010000010100001111001011001001010110011111001011010010110101110010011100111101101011110 e8b699eba0a1e592b3e686a4e8b699eba0a1e592b3e5a5ae4e7de8b699eba0a1e592b3e686a4e8b699eba0a1e592b3e5a5ae4e7b5e
UHC 趙렡咳憤趙렡咳奮N}趙렡咳憤趙렡咳奮N{^ 11110000111000011000111010110010111110101010011011011101110010011111000011100001100011101011001011111010101001101101110111000111010011100111110111110000111000011000111010110010111110101010011011011101110010011111000011100001100011101011001011111010101001101101110111000111010011100111101101011110 f0e18eb2faa6ddc9f0e18eb2faa6ddc74e7df0e18eb2faa6ddc9f0e18eb2faa6ddc74e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)