To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 垓?楷???垓?楷??拒垓?楷???垓?楷??居^ 1001101010110100001111111001111010110010001111110011111100111111100110101011010000111111100111101011001000111111001111111000101110010001100110101011010000111111100111101011001000111111001111110011111110011010101101000011111110011110101100100011111100111111100010111000111101011110 9ab43f9eb23f3f3f9ab43f9eb23f3f8b919ab43f9eb23f3f3f9ab43f9eb23f3f8b8f5e
EUC-JP 垓?楷???垓?楷??拒垓?楷???垓?楷??居^ 1101010010110110001111111101110010110100001111110011111100111111110101001011011000111111110111001011010000111111001111111011010111110001110101001011011000111111110111001011010000111111001111110011111111010100101101100011111111011100101101000011111100111111101101011110111101011110 d4b63fdcb43f3f3fd4b63fdcb43f3fb5f1d4b63fdcb43f3f3fd4b63fdcb43f3fb5ef5e
UTF-8 垓렡楷곈렕렧垓렡楷곈렠拒垓렡楷곈렕렧垓렡楷곈렠居^ 11100101100111101001001111101011101000001010000111100110101001011011011111101010101100111000100011101011101000001001010111101011101000001010011111100101100111101001001111101011101000001010000111100110101001011011011111101010101100111000100011101011101000001010000011100110100010111001001011100101100111101001001111101011101000001010000111100110101001011011011111101010101100111000100011101011101000001001010111101011101000001010011111100101100111101001001111101011101000001010000111100110101001011011011111101010101100111000100011101011101000001010000011100101101100011000010101011110 e59e93eba0a1e6a5b7eab388eba095eba0a7e59e93eba0a1e6a5b7eab388eba0a0e68b92e59e93eba0a1e6a5b7eab388eba095eba0a7e59e93eba0a1e6a5b7eab388eba0a0e5b1855e
UHC 垓렡楷곈렕렧垓렡楷곈렠拒垓렡楷곈렕렧垓렡楷곈렠居^ 11111010101001111000111010110010111110101010110010110000111010011000111010101010100011101011011011111010101001111000111010110010111110101010110010110000111010011000111010110001110010111101111011111010101001111000111010110010111110101010110010110000111010011000111010101010100011101011011011111010101001111000111010110010111110101010110010110000111010011000111010110001110010111101110001011110 faa78eb2faacb0e98eaa8eb6faa78eb2faacb0e98eb1cbdefaa78eb2faacb0e98eaa8eb6faa78eb2faacb0e98eb1cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)