To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????h??????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100000111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f
SJIS-WIN ???已?????譽?ジ吟??h???已??? 00111111001111110011111110011011110111110011111100111111001111110011111100111111111001101010001100111111100000110101011110001011111000010011111100111111011010000011111100111111001111111001101111011111001111110011111100111111 3f3f3f9bdf3f3f3f3f3fe6a33f83578be13f3f683f3f3f9bdf3f3f3f
EUC-JP ???已?????譽?ジ吟??h???已??? 00111111001111110011111111010110111000010011111100111111001111110011111100111111111011001010010100111111101001011011100010110110111000110011111100111111011010000011111100111111001111111101011011100001001111110011111100111111 3f3f3fd6e13f3f3f3f3feca53fa5b8b6e33f3f683f3f3fd6e13f3f3f
UTF-8 琉뗨꽮已몄깦溜묐쨱譽뱀ジ吟섎젒h琉뗨꽮已몄깦溜 11101111101001111000110011101011100101111010100011101010101111011010111011100101101101111011001011101011101010101000010011101010101110011010011011101111101001111000101111101011101011001001000011101100101010001011000111101000101011011011110111101011101100011000000011100011100000101011100011100101100100001001111111101100100001001000111011101100101000001001001001101000111011111010011110001100111010111001011110101000111010101011110110101110111001011011011110110010111010111010101010000100111010101011100110100110111011111010011110001011 efa78ceb97a8eabdaee5b7b2ebaa84eab9a6efa78bebac90eca8b1e8adbdebb180e382b8e5909fec848eeca09268efa78ceb97a8eabdaee5b7b2ebaa84eab9a6efa78b
UHC 琉뗨꽮已몄깦溜묐쨱譽뱀ジ吟섎젒h琉뗨꽮已몄깦溜 111010111010010010001011111010001000010010111001111011001010101110111000111011001000001110011000111010101111111010010001111010111010010010001011111001111110001010111001111011001010101110111000111010111110000110011000111010111010000010010001011010001110101110100100100010111110100010000100101110011110110010101011101110001110110010000011100110001110101011111110 eba48be884b9ecabb8ec8398eafe91eba48be7e2b9ecabb8ebe198eba09168eba48be884b9ecabb8ec8398eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)