To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 町?郵?臍??爰?佇C町?郵?臍??爰?佇CB 1001001010101100001111111001011101011000001111111110010001100000001111110011111111100000101001110011111110011000110010001000001001100010100100101010110000111111100101110101100000111111111001000110000000111111001111111110000010100111001111111001100011001000100000100110001001000010 92ac3f97583fe4603f3fe0a73f98c8826292ac3f97583fe4603f3fe0a73f98c8826242
EUC-JP 町?郵?臍??爰?佇C町?郵?臍??爰?佇CB 1100010010101110001111111100110110111001001111111110011111000001001111110011111111100000101010010011111111010000110010101010001111000011110001001010111000111111110011011011100100111111111001111100000100111111001111111110000010101001001111111101000011001010101000111100001101000010 c4ae3fcdb93fe7c13f3fe0a93fd0caa3c3c4ae3fcdb93fe7c13f3fe0a93fd0caa3c342
UTF-8 町렊郵렮臍잴혈爰렪佇C町렊郵렮臍잴혈爰렪佇CB 11100111100101001011101011101011101000001000101011101001100000111011010111101011101000001010111011101000100001111000110111101100100111101011010011101101100110001000100011100111100010001011000011101011101000001010101011100100101111011000011111101111101111001010001111100111100101001011101011101011101000001000101011101001100000111011010111101011101000001010111011101000100001111000110111101100100111101011010011101101100110001000100011100111100010001011000011101011101000001010101011100100101111011000011111101111101111001010001101000010 e794baeba08ae983b5eba0aee8878dec9eb4ed9888e788b0eba0aae4bd87efbca3e794baeba08ae983b5eba0aee8878dec9eb4ed9888e788b0eba0aae4bd87efbca342
UHC 町렊郵렮臍잴혈爰렪佇C町렊郵렮臍잴혈爰렪佇CB 111011111110101110001110101000011110100111101000100011101011101111110000101100001100000011101010110001111111011111101010101110101000111010111000111011101011011110100011110000111110111111101011100011101010000111101001111010001000111010111011111100001011000011000000111010101100011111110111111010101011101010001110101110001110111010110111101000111100001101000010 efeb8ea1e9e88ebbf0b0c0eac7f7eaba8eb8eeb7a3c3efeb8ea1e9e88ebbf0b0c0eac7f7eaba8eb8eeb7a3c342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)