To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 狎??妖??獰??[狎??妖??獰??[^ 111000001011111000111111001111111001011101100100001111110011111111100000110101100011111100111111010110111110000010111110001111110011111110010111011001000011111100111111111000001101011000111111001111110101101101011110 e0be3f3f97643f3fe0d63f3f5be0be3f3f97643f3fe0d63f3f5b5e
EUC-JP 狎??妖??獰??[狎??妖??獰??[^ 111000001100000000111111001111111100110111000101001111110011111111100000110110000011111100111111010110111110000011000000001111110011111111001101110001010011111100111111111000001101100000111111001111110101101101011110 e0c03f3fcdc53f3fe0d83f3f5be0c03f3fcdc53f3fe0d83f3f5b5e
UTF-8 狎볡푻妖덂넃獰뉔룉[狎볡푻妖덂넃獰뉔룉[^ 111001111000101110001110111010111011001110100001111011011001000110111011111001011010011010010110111010111000110110000010111010111000010010000011111001111000110110110000111010111000100110010100111010111010001110001001010110111110011110001011100011101110101110110011101000011110110110010001101110111110010110100110100101101110101110001101100000101110101110000100100000111110011110001101101100001110101110001001100101001110101110100011100010010101101101011110 e78b8eebb3a1ed91bbe5a696eb8d82eb8483e78db0eb8994eba3895be78b8eebb3a1ed91bbe5a696eb8d82eb8483e78db0eb8994eba3895b5e
UHC 狎볡푻妖덂넃獰뉔룉[狎볡푻妖덂넃獰뉔룉[^ 111001001110010010010011111001111011111010000111111010001110110110001000111001011000011010010011111001111011111010000111111010011000111110001000010110111110010011100100100100111110011110111110100001111110100011101101100010001110010110000110100100111110011110111110100001111110100110001111100010000101101101011110 e4e493e7be87e8ed88e58693e7be87e98f885be4e493e7be87e8ed88e58693e7be87e98f885b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)