To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??泣?????馭??筍??飮??筌??B 11100010101000110011111100111111100010111000001100111111001111110011111100111111001111111110100101100110001111110011111111100010101000010011111100111111100111110101101000111111001111111110001010100011001111110011111101000010 e2a33f3f8b833f3f3f3f3fe9663f3fe2a13f3f9f5a3f3fe2a33f3f42
EUC-JP 筌??泣?????馭??筍??飮??筌??B 11100100101001010011111100111111101101011110001100111111001111110011111100111111001111111111000111000111001111110011111111100100101000110011111100111111110111011011101100111111001111111110010010100101001111110011111101000010 e4a53f3fb5e33f3f3f3f3ff1c73f3fe4a33f3fddbb3f3fe4a53f3f42
UTF-8 筌욌벤泣욑쭪類ㅺ봄馭곥룂筍싷㏊飮덉췅筌욎릍B 11100111101011011000110011101100100110101000110011101011101100101010010011100110101100111010001111101100100110101001000111101100101011011010101011101111101001111001000011100011100001011011101011101011101101001000010011101001101001101010110111101010101100111010010111101011101000111000001011100111101011011000110111101100100010111011011111100011100011111000101011101001101000111010111011101011100011011000100111101100101101111000010111100111101011011000110011101100100110101000111011101011101001101000110101000010 e7ad8cec9a8cebb2a4e6b3a3ec9a91ecadaaefa790e385baebb484e9a6adeab3a5eba382e7ad8dec8bb7e38f8ae9a3aeeb8d89ecb785e7ad8cec9a8eeba68d42
UHC 筌욌벤泣욑쭪類ㅺ봄馭곥룂筍싷㏊飮덉췅筌욎릍B 11101111101001111001111011101011101110101010010111101011111010001001111011101111101001111001111011101011101110101010010011101010101110101011110111100101110111111000000111100011100011111000001111100010111011001001101011101111101001111011010111101011111001101000100011101100101011011010000011101111101001111001111011101100101110001010110001000010 efa79eebbaa5ebe89eefa79eebbaa4eababde5df81e38f83e2ec9aefa7b5ebe688ecada0efa79eecb8ac42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)