To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ティテ淞姪ヲツ仰骸ティテ淞姪ヲツ仰骸^ 110000111010100011000011100111111100001010010110110000111010011011000010100010111100001010001010010110111100001110101000110000111001111111000010100101101100001110100110110000101000101111000010100010100101101101011110 c3a8c39fc296c3a6c28bc28a5bc3a8c39fc296c3a6c28bc28a5b5e
EUC-JP ティテ淞姪ヲツ仰骸ティテ淞姪ヲツ仰骸^ 10001110110000111000111010101000100011101100001111011110110001001100110011000101100011101010011010001110110000101011011011000100101100111011110010001110110000111000111010101000100011101100001111011110110001001100110011000101100011101010011010001110110000101011011011000100101100111011110001011110 8ec38ea88ec3dec4ccc58ea68ec2b6c4b3bc8ec38ea88ec3dec4ccc58ea68ec2b6c4b3bc5e
UTF-8 ティテ淞姪ヲツ仰骸ティテ淞姪ヲツ仰骸^ 11101111101111101000001111101111101111011010100011101111101111101000001111100110101101111001111011100101101001111010101011101111101111011010011011101111101111101000001011100100101110111011000011101001101010101011100011101111101111101000001111101111101111011010100011101111101111101000001111100110101101111001111011100101101001111010101011101111101111011010011011101111101111101000001011100100101110111011000011101001101010101011100001011110 efbe83efbda8efbe83e6b79ee5a7aaefbda6efbe82e4bbb0e9aab8efbe83efbda8efbe83e6b79ee5a7aaefbda6efbe82e4bbb0e9aab85e
UHC ???淞姪??仰骸???淞姪??仰骸^ 001111110011111100111111111000011110011111110010111010110011111100111111111001001110011011111010101101010011111100111111001111111110000111100111111100101110101100111111001111111110010011100110111110101011010101011110 3f3f3fe1e7f2eb3f3fe4e6fab53f3f3fe1e7f2eb3f3fe4e6fab55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)