To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 壕?楷??槃斌楷??模壕?楷??槃斌楷??模^ 1000110110001000001111111001111010110010001111110011111110011110110011111001010101101011100111101011001000111111001111111001011011001101100011011000100000111111100111101011001000111111001111111001111011001111100101010110101110011110101100100011111100111111100101101100110101011110 8d883f9eb23f3f9ecf956b9eb23f3f96cd8d883f9eb23f3f9ecf956b9eb23f3f96cd5e
EUC-JP 壕?楷?濩槃斌楷?濩模壕?楷?濩槃斌楷?濩模^ 10111001111010000011111111011100101101000011111110001111110010011010010011011100110100011100100111001100110111001011010000111111100011111100100110100100110011001100111110111001111010000011111111011100101101000011111110001111110010011010010011011100110100011100100111001100110111001011010000111111100011111100100110100100110011001100111101011110 b9e83fdcb43f8fc9a4dcd1c9ccdcb43f8fc9a4cccfb9e83fdcb43f8fc9a4dcd1c9ccdcb43f8fc9a4cccf5e
UTF-8 壕렜楷렠濩槃斌楷렠濩模壕렜楷렠濩槃斌楷렠濩模^ 11100101101000111001010111101011101000001001110011100110101001011011011111101011101000001010000011100110101111111010100111100110101001111000001111100110100101101000110011100110101001011011011111101011101000001010000011100110101111111010100111100110101010001010000111100101101000111001010111101011101000001001110011100110101001011011011111101011101000001010000011100110101111111010100111100110101001111000001111100110100101101000110011100110101001011011011111101011101000001010000011100110101111111010100111100110101010001010000101011110 e5a395eba09ce6a5b7eba0a0e6bfa9e6a783e6968ce6a5b7eba0a0e6bfa9e6a8a1e5a395eba09ce6a5b7eba0a0e6bfa9e6a783e6968ce6a5b7eba0a0e6bfa9e6a8a15e
UHC 壕렜楷렠濩槃斌楷렠濩模壕렜楷렠濩槃斌楷렠濩模^ 111110111011110110001110101011101111101010101100100011101011000111111011110011011101101011101001110111101011000011111010101011001000111010110001111110111100110111011001101111001111101110111101100011101010111011111010101011001000111010110001111110111100110111011010111010011101111010110000111110101010110010001110101100011111101111001101110110011011110001011110 fbbd8eaefaac8eb1fbcddae9deb0faac8eb1fbcdd9bcfbbd8eaefaac8eb1fbcddae9deb0faac8eb1fbcdd9bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)