To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 怏??熱??繞??有??飮???i?飮??^ 1001110010001001001111110011111110010100010011010011111100111111111000111000010100111111001111111001011101001100001111110011111110011111010110100011111100111111001111111000001010001001001111111001111101011010001111110011111101011110 9c893f3f944d3f3fe3853f3f974c3f3f9f5a3f3f3f82893f9f5a3f3f5e
EUC-JP 怏??熱??繞??有??飮???i?飮??^ 1101011111101001001111110011111111000111101011100011111100111111111001011110010100111111001111111100110110101101001111110011111111011101101110110011111100111111001111111010001111101001001111111101110110111011001111110011111101011110 d7e93f3fc7ae3f3fe5e53f3fcdad3f3fddbb3f3f3fa3e93fddbb3f3f5e
UTF-8 怏얠늾熱듬베繞섏닠有뗥듋飮뉒뮫類i쨾飮좊븢^ 11100110100000001000111111101100100101101010000011101011100010101011111011100111100001101011000111101011100100111010110011101011101100101010000011100111101110011001111011101100100001001000111111101011100010111010000011100110100111001000100111101011100101111010010111101011100100111000101111101001101000111010111011101011100010011001001011101011101011101010101111101111101001111001000011101111101111011000100111101100101010001011111011101001101000111010111011101100101000101000101011101011101110001010001001011110 e6808fec96a0eb8abee786b1eb93acebb2a0e7b99eec848feb8ba0e69c89eb97a5eb938be9a3aeeb8992ebaeabefa790efbd89eca8bee9a3aeeca28aebb8a25e
UHC 怏얠늾熱듬베繞섏닠有뗥듋飮뉒뮫類i쨾飮좊븢^ 11100100111010001011111011101100100010001000011111100110111100001011010111101011101110101010001111101001101001001001100011101100100010001010000011101010111100111000101111100101100010101011111011101011111001101000011111100111100100101011010111101011101110101010001111101001101001001001100011101011111001101010000011101011100101011000101101011110 e4e8beec8887e6f0b5ebbaa3e9a498ec88a0eaf38be58abeebe687e792b5ebbaa3e9a498ebe6a0eb958b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)