To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????±?????????^ 00111111001111110011111100111111001111110011111100111111001111111011000100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3fb13f3f3f3f3f3f3f3f3f5e
SJIS-WIN ツ咫ツ哭ツ咫ツ慊±ツ咫ツ哭ツ咫ツ慊+^ 1100001010011010010000001100001010011010010011001100001010011010010000001100001010011100110000101000000101111101110000101001101001000000110000101001101001001100110000101001101001000000110000101001110011000010100000010111101101011110 c29a40c29a4cc29a40c29cc2817dc29a40c29a4cc29a40c29cc2817b5e
EUC-JP ツ咫ツ哭ツ咫ツ慊±ツ咫ツ哭ツ咫ツ慊+^ 10001110110000101101001110100001100011101100001011010011101011011000111011000010110100111010000110001110110000101101100011000100101000011101111010001110110000101101001110100001100011101100001011010011101011011000111011000010110100111010000110001110110000101101100011000100101000011101110001011110 8ec2d3a18ec2d3ad8ec2d3a18ec2d8c4a1de8ec2d3a18ec2d3ad8ec2d3a18ec2d8c4a1dc5e
UTF-8 ツ咫ツ哭ツ咫ツ慊±ツ咫ツ哭ツ咫ツ慊+^ 111011111011111010000010111001011001001010101011111011111011111010000010111001011001001110101101111011111011111010000010111001011001001010101011111011111011111010000010111001101000010110001010110000101011000111101111101111101000001011100101100100101010101111101111101111101000001011100101100100111010110111101111101111101000001011100101100100101010101111101111101111101000001011100110100001011000101011101111101111001000101101011110 efbe82e592abefbe82e593adefbe82e592abefbe82e6858ac2b1efbe82e592abefbe82e593adefbe82e592abefbe82e6858aefbc8b5e
UHC ?咫?哭?咫?慊±?咫?哭?咫?慊+^ 0011111111110010101000010011111111001101110101100011111111110010101000010011111111001100110000111010000110111110001111111111001010100001001111111100110111010110001111111111001010100001001111111100110011000011101000111010101101011110 3ff2a13fcdd63ff2a13fccc3a1be3ff2a13fcdd63ff2a13fccc3a3ab5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)