To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 렻┙┖ㅔ렎렻┙ㅕㅔ렧렻┙ㅕㅔ늚┙┖ㅔ렚렻┙ㅕㅔ슝B 11101011101000001011101111100010100101001001100111100010100101001001011011100011100001011001010011101011101000001000111011101011101000001011101111100010100101001001100111100011100001011001010111100011100001011001010011101011101000001010011111101011101000001011101111100010100101001001100111100011100001011001010111100011100001011001010011101011100010101001101011100010100101001001100111100010100101001001011011100011100001011001010011101011101000001001101011101011101000001011101111100010100101001001100111100011100001011001010111100011100001011001010011101100100010101001110101000010 eba0bbe29499e29496e38594eba08eeba0bbe29499e38595e38594eba0a7eba0bbe29499e38595e38594eb8a9ae29499e29496e38594eba09aeba0bbe29499e38595e38594ec8a9d42
UHC 렻┙┖ㅔ렎렻┙ㅕㅔ렧렻┙ㅕㅔ늚┙┖ㅔ렚렻┙ㅕㅔ슝B 10001110110000111010011011000100101001101100010110100100110001001000111010100100100011101100001110100110110001001010010011000101101001001100010010001110101101101000111011000011101001101100010010100100110001011010010011000100101101001100010110100110110001001010011011000101101001001100010010001110101011011000111011000011101001101100010010100100110001011010010011000100101111011011100101000010 8ec3a6c4a6c5a4c48ea48ec3a6c4a4c5a4c48eb68ec3a6c4a4c5a4c4b4c5a6c4a6c5a4c48ead8ec3a6c4a4c5a4c4bdb942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)