To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???埃??吟??}???埃??吟??{^ 00111111001111110011111110011010101110100011111100111111100010111110000100111111001111110111110100111111001111110011111110011010101110100011111100111111100010111110000100111111001111110111101101011110 3f3f3f9aba3f3f8be13f3f7d3f3f3f9aba3f3f8be13f3f7b5e
EUC-JP ???埃??吟??}???埃??吟??{^ 00111111001111110011111111010100101111000011111100111111101101101110001100111111001111110111110100111111001111110011111111010100101111000011111100111111101101101110001100111111001111110111101101011110 3f3f3fd4bc3f3fb6e33f3f7d3f3f3fd4bc3f3fb6e33f3f7b5e
UTF-8 琉뗥깋埃덊맔吟끾뀲}琉뗥깋埃덊맔吟끾뀲{^ 111011111010011110001100111010111001011110100101111010101011100110001011111001011001111110000011111010111000110110001010111010111010011110010100111001011001000010011111111010111000000110111110111010111000000010110010011111011110111110100111100011001110101110010111101001011110101010111001100010111110010110011111100000111110101110001101100010101110101110100111100101001110010110010000100111111110101110000001101111101110101110000000101100100111101101011110 efa78ceb97a5eab98be59f83eb8d8aeba794e5909feb81beeb80b27defa78ceb97a5eab98be59f83eb8d8aeba794e5909feb81beeb80b27b5e
UHC 琉뗥깋埃덊맔吟끾뀲}琉뗥깋埃덊맔吟끾뀲{^ 111010111010010010001011111001011000001110001001111001001110111110001000111011011001000010100110111010111110000110000101111001101000010110101000011111011110101110100100100010111110010110000011100010011110010011101111100010001110110110010000101001101110101111100001100001011110011010000101101010000111101101011110 eba48be58389e4ef88ed90a6ebe185e685a87deba48be58389e4ef88ed90a6ebe185e685a87b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)