To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈??油??矣??艶l?肄??宋????【? 100111101111010000111111001111111001011011111011001111110011111111100001111000010011111100111111100010011001000010000010100011000011111111100011111001010011111100111111100100010111011000111111001111110011111100111111100000010111100100111111 9ef43f3f96fb3f3fe1e13f3f8990828c3fe3e53f3f91763f3f3f3f81793f
EUC-JP 橈??油??矣??艶l?肄??宋??孼?【? 1101110011110110001111110011111111001100111111010011111100111111111000101110001100111111001111111011000111110000101000111110110000111111111001101110011100111111001111111100000111010111001111110011111110001111101110101100001100111111101000011101101000111111 dcf63f3fccfd3f3fe2e33f3fb1f0a3ec3fe6e73f3fc1d73f3f8fbac33fa1da3f
UTF-8 橈볥굝油꾦걬矣멥렆艶l꼵肄잌떳宋믨슈孼꾩【溜 111001101010100110001000111010111011001110100101111010101011010110011101111001101011001010111001111010101011111010100110111010101011000110101100111001111001111110100011111010111010100110100101111010111010000010000110111010001000100110110110111011111011110110001100111010101011110010110101111010001000001010000100111011001001111010001100111010111001011010110011111001011010111010001011111010111010111110101000111011001000101010001000111001011010110110111100111010101011111010101001111000111000000010010000111011111010011110001011 e6a988ebb3a5eab59de6b2b9eabea6eab1ace79fa3eba9a5eba086e889b6efbd8ceabcb5e88284ec9e8ceb96b3e5ae8bebafa8ec8a88e5adbceabea9e38090efa78b
UHC 橈볥굝油꾦걬矣멥렆艶l꼵肄잌떳宋믨슈孼꾩【溜 1110100011111010100100111110101110000010100001011110101011111010100001001110100110000001100101011110101111111000101110001110001110001110101000001110011011111101101000111110110010000100100011011110110010111101100111111110010110110110101110001110000111100100100100101110101010111101101101001110010111101101100001001110110010100001101111001110101011111110 e8fa93eb8285eafa84e98195ebf8b8e38ea0e6fda3ec848decbd9fe5b6b8e1e492eabdb4e5ed84eca1bceafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)