To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 烝?億???????烝?億???????^ 11100000011111100011111110001001101011010011111100111111001111110011111100111111001111110011111111100000011111100011111110001001101011010011111100111111001111110011111100111111001111110011111101011110 e07e3f89ad3f3f3f3f3f3f3fe07e3f89ad3f3f3f3f3f3f3f5e
EUC-JP 烝?億??泮????烝?億??泮????^ 1101111111011111001111111011001010101111001111110011111110001111110001111010100000111111001111110011111100111111110111111101111100111111101100101010111100111111001111111000111111000111101010000011111100111111001111110011111101011110 dfdf3fb2af3f3f8fc7a83f3f3f3fdfdf3fb2af3f3f8fc7a83f3f3f3f5e
UTF-8 烝렯億계렊泮렗멱렞렢烝렯億계렊泮렗멱렞렗^ 11100111100000111001110111101011101000001010111111100101100001001000010011101010101100111000010011101011101000001000101011100110101100111010111011101011101000001001011111101011101010011011000111101011101000001001111011101011101000001010001011100111100000111001110111101011101000001010111111100101100001001000010011101010101100111000010011101011101000001000101011100110101100111010111011101011101000001001011111101011101010011011000111101011101000001001111011101011101000001001011101011110 e7839deba0afe58484eab384eba08ae6b3aeeba097eba9b1eba09eeba0a2e7839deba0afe58484eab384eba08ae6b3aeeba097eba9b1eba09eeba0975e
UHC 烝렯億계렊泮렗멱렞렢烝렯億계렊泮렗멱렞렗^ 1111000111110110100011101011110011100101111000101011000011101000100011101010000111011010111010101000111010101100101110001110100010001110101011111000111010110011111100011111011010001110101111001110010111100010101100001110100010001110101000011101101011101010100011101010110010111000111010001000111010101111100011101010110001011110 f1f68ebce5e2b0e88ea1daea8eacb8e88eaf8eb3f1f68ebce5e2b0e88ea1daea8eacb8e88eaf8eac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)