To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 曄?????癰??}曄?????癰??{^ 10011110010000000011111100111111001111110011111100111111111000011001111000111111001111110111110110011110010000000011111100111111001111110011111100111111111000011001111000111111001111110111101101011110 9e403f3f3f3f3fe19e3f3f7d9e403f3f3f3f3fe19e3f3f7b5e
EUC-JP 曄?????癰??}曄?????癰??{^ 11011011101000010011111100111111001111110011111100111111111000011111111000111111001111110111110111011011101000010011111100111111001111110011111100111111111000011111111000111111001111110111101101011110 dba13f3f3f3f3fe1fe3f3f7ddba13f3f3f3f3fe1fe3f3f7b5e
UTF-8 曄됯퀗溜깅졁癰잜꽩}曄됯퀗溜깅졁癰잜꽩{^ 111001101001101110000100111010111001000010101111111011011000000010010111111011111010011110001011111010101011100110000101111011001010000110000001111001111001100110110000111011001001111010011100111010101011110110101001011111011110011010011011100001001110101110010000101011111110110110000000100101111110111110100111100010111110101010111001100001011110110010100001100000011110011110011001101100001110110010011110100111001110101010111101101010010111101101011110 e69b84eb90afed8097efa78beab985eca181e799b0ec9e9ceabda97de69b84eb90afed8097efa78beab985eca181e799b0ec9e9ceabda97b5e
UHC 曄됯퀗溜깅졁癰잜꽩}曄됯퀗溜깅졁癰잜꽩{^ 111001111010010110001001111010101011001110001100111010101111111010110001111010111010000010110010111010001011100110011111111011011000010010110100011111011110011110100101100010011110101010110011100011001110101011111110101100011110101110100000101100101110100010111001100111111110110110000100101101000111101101011110 e7a589eab38ceafeb1eba0b2e8b99fed84b47de7a589eab38ceafeb1eba0b2e8b99fed84b47b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)