To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???kf???k^}Y???kf???k^}bE 00111111001111110011111101101011011001100011111100111111001111110110101101011110011111010101100100111111001111110011111101101011011001100011111100111111001111110110101101011110011111010110001001000101 3f3f3f6b663f3f3f6b5e7d593f3f3f6b663f3f3f6b5e7d6245
SJIS-WIN タ羶kfタ羶k^}Yタ羶kfタ羶k^}bE 110000001110001110111101111100100110111001101011011001101100000011100011101111011111001001101110011010110101111001111101010110011100000011100011101111011111001001101110011010110110011011000000111000111011110111110010011011100110101101011110011111010110001001000101 c0e3bdf26e6b66c0e3bdf26e6b5e7d59c0e3bdf26e6b66c0e3bdf26e6b5e7d6245
EUC-JP タ羶?kfタ羶?k^}Yタ羶?kfタ羶?k^}bE 100011101100000011100110101111110011111101101011011001101000111011000000111001101011111100111111011010110101111001111101010110011000111011000000111001101011111100111111011010110110011010001110110000001110011010111111001111110110101101011110011111010110001001000101 8ec0e6bf3f6b668ec0e6bf3f6b5e7d598ec0e6bf3f6b668ec0e6bf3f6b5e7d6245
UTF-8 タ羶kfタ羶k^}Yタ羶kfタ羶k^}bE 11101111101111101000000011100111101111101011011011101110100001101010011001101011011001101110111110111110100000001110011110111110101101101110111010000110101001100110101101011110011111010101100111101111101111101000000011100111101111101011011011101110100001101010011001101011011001101110111110111110100000001110011110111110101101101110111010000110101001100110101101011110011111010110001001000101 efbe80e7beb6ee86a66b66efbe80e7beb6ee86a66b5e7d59efbe80e7beb6ee86a66b66efbe80e7beb6ee86a66b5e7d6245
UHC ???kf???k^}Y???kf???k^}bE 00111111001111110011111101101011011001100011111100111111001111110110101101011110011111010101100100111111001111110011111101101011011001100011111100111111001111110110101101011110011111010110001001000101 3f3f3f6b663f3f3f6b5e7d593f3f3f6b663f3f3f6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)