To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣х??⑥?潁??揖ф?擬??癲??柚 00111111001111110011111110001011100000111000010010000111001111110011111110000111010001010011111110011111111100010011111100111111100101110100101110000100100001100011111110001011010110110011111100111111111000011001111100111111001111111001011101001101 3f3f3f8b8384873f3f87453f9ff13f3f974b84863f8b5b3f3fe19f3f3f974d
EUC-JP ???泣х?洹??潁??揖ф?擬??癲??柚 0011111100111111001111111011010111100011101001111110011100111111100011111100011110111010001111110011111111011110111100110011111100111111110011011010110010100111111001100011111110110101101111000011111100111111111000101010000100111111001111111100110110101110 3f3f3fb5e3a7e73f8fc7ba3f3fdef33f3fcdaca7e63fb5bc3f3fe2a13f3fcdae
UTF-8 黎싳뼲泣х툞洹⑥돩潁뺢랬揖ф콢擬듭뒳癲얘엥柚 11101111101001101000100111101100100010111011001111101011101111001011001011100110101100111010001111010001100001011110110110001000100111101110011010110100101110011110001010010001101001011110101110001111101010011110011010111101100000011110101110111010101000101110101110011110101011001110011010001111100101101101000110000100111011001011110110100010111001101001001110101100111010111001001110101101111010111001001010110011111001111001100110110010111011001001011010011000111011001001011110100101111001101001111110011010 efa689ec8bb3ebbcb2e6b3a3d185ed889ee6b4b9e291a5eb8fa9e6bd81ebbaa2eb9eace68f96d184ecbda2e693aceb93adeb92b3e799b2ec9698ec97a5e69f9a
UHC 黎싳뼲泣х툞洹⑥돩潁뺢랬揖ф콢擬듭뒳癲얘엥柚 1110011010110001100110101110110010010110101101011110101111101000101011001110011110111000100101011110101010110111101010001110110010001001101011001110011110111000100101011110101010110111101010001110101111100111101011001110011010110001100110101110101111110100101101011110110010001010101011001110111110100110101111101110101010111111101010001110101011110110 e6b19aec96b5ebe8ace7b895eab7a8ec89ace7b895eab7a8ebe7ace6b19aebf4b5ec8aacefa6beeabfa8eaf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)