To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??艤??節??儼??泣??乙λ?腋 111000111010000000111111001111111110010001111110001111110011111110010000110111110011111100111111100110010101011000111111001111111000101110000011001111110011111110001001101100111000001111001001001111111110001111111100 e3a03f3fe47e3f3f90df3f3f99563f3f8b833f3f89b383c93fe3fc
EUC-JP 罌??艤??節??儼??泣??乙λ?腋 111001101010001000111111001111111110011111011111001111110011111111000000111000010011111100111111110100011011011100111111001111111011010111100011001111110011111110110010101101011010011011001011001111111110011011111110 e6a23f3fe7df3f3fc0e13f3fd1b73f3fb5e33f3fb2b5a6cb3fe6fe
UTF-8 罌산막艤욕♤節륂뭵儼볥톪泣섆벧乙λ븴腋 1110011110111101100011001110110010000010101100001110101110100111100010011110100010001001101001001110110010011010100101011110001010011001101001001110011110101111100000001110101110100101100000101110101110101101101101011110010110000100101111001110101110110011101001011110110110000110101010101110011010110011101000111110110010000100100001101110101110110010101001111110010010111001100110011100111010111011111010111011100010110100111010001000010110001011 e7bd8cec82b0eba789e889a4ec9a95e299a4e7af80eba582ebadb5e584bcebb3a5ed86aae6b3a3ec8486ebb2a7e4b999cebbebb8b4e8858b
UHC 罌산막艤욕♤節륂뭵儼볥톪泣섆벧乙λ븴腋 1110010110100010101110111110101010111000101101111110101111111010101111111110010110100010101110111110111110111101100011111110110110010010100001001110010111110000100100111110101110110111100000101110101111101000100110001110010010111010101001101110101111100000101001011110101110010101100111011110010011111101 e5a2bbeab8b7ebfabfe5a2bbefbd8fed9284e5f093ebb782ebe898e4baa6ebe0a5eb959de4fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)