To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???鎰???h?夜??榮??荏??預??^ 00111111001111110011111111101000010011000011111100111111001111111000001010001000001111111001011011101001001111110011111110011110110001000011111100111111100010010110000000111111001111111001011101100001001111110011111101011110 3f3f3fe84c3f3f3f82883f96e93f3f9ec43f3f89603f3f97613f3f5e
EUC-JP ???鎰???h?夜??榮??荏??預??^ 00111111001111110011111111101111101011010011111100111111001111111010001111101000001111111100110011101011001111110011111111011100110001100011111100111111101100011100000100111111001111111100110111000010001111110011111101011110 3f3f3fefad3f3f3fa3e83fcceb3f3fdcc63f3fb1c13f3fcdc23f3f5e
UTF-8 琉쀧큹鎰뉙쓲溜h뒡夜쇰쨽榮덊뭸荏먮젫預앭돲^ 11101111101001111000110011101100100000001010011111101101100000011011100111101001100011101011000011101011100010011001100111101100100100111011001011101111101001111000101111101111101111011000100011101011100100101010000111100101101001001001110011101100100001111011000011101100101010001011110111100110101001101010111011101011100011011000101011101011101011011011100011101000100011011000111111101011101010001010111011101100101000001010101111101001101000001001000011101100100101011010110111101011100011111011001001011110 efa78cec80a7ed81b9e98eb0eb8999ec93b2efa78befbd88eb92a1e5a49cec87b0eca8bde6a6aeeb8d8aebadb8e88d8feba8aeeca0abe9a090ec95adeb8fb25e
UHC 琉쀧큹鎰뉙쓲溜h뒡夜쇰쨽榮덊뭸荏먮젫預앭돲^ 11101011101001001001011111100111101101001000100011101100111100001000011111101101100111011001000011101010111111101010001111101000100010101001110111100101101010001011110011101011101001001001011111100111101101001000100011101101100100101000011111101100111110111001000011101011101000001010001111100111111010001001110111100101100010011011010101011110 eba497e7b488ecf087ed9d90eafea3e88a9de5a8bceba497e7b488ed9287ecfb90eba0a3e7e89de589b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)