To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN ?功∮??た?訥?U}?功∮??た?訥?U{^ 00111111100011001111011110000111100100110011111100111111100000101011110100111111111001100110001100111111010101010111110100111111100011001111011110000111100100110011111100111111100000101011110100111111111001100110001100111111010101010111101101011110 3f8cf787933f3f82bd3fe6633f557d3f8cf787933f3f82bd3fe6633f557b5e
EUC-JP ?功???た?訥?U}?功???た?訥?U{^ 0011111110111000111110010011111100111111001111111010010010111111001111111110101111000100001111110101010101111101001111111011100011111001001111110011111100111111101001001011111100111111111010111100010000111111010101010111101101011110 3fb8f93f3f3fa4bf3febc43f557d3fb8f93f3f3fa4bf3febc43f557b5e
UTF-8 룴功∮룫혧た룶訥▣U}룴功∮룫혧た룶訥▣U{^ 1110101110100011101101001110010110001010100111111110001010001000101011101110101110100011101010111110110110011000101001111110001110000001100111111110101110100011101101101110100010101000101001011110001010010110101000110101010101111101111010111010001110110100111001011000101010011111111000101000100010101110111010111010001110101011111011011001100010100111111000111000000110011111111010111010001110110110111010001010100010100101111000101001011010100011010101010111101101011110 eba3b4e58a9fe288aeeba3abed98a7e3819feba3b6e8a8a5e296a3557deba3b4e58a9fe288aeeba3abed98a7e3819feba3b6e8a8a5e296a3557b5e
UHC 룴功∮룫혧た룶訥▣U}룴功∮룫혧た룶訥▣U{^ 1000111110101001110011011110110110100010101100011000111110100010110000101000111110101010101111111000111110101011110100101110110110100010110000110101010101111101100011111010100111001101111011011010001010110001100011111010001011000010100011111010101010111111100011111010101111010010111011011010001011000011010101010111101101011110 8fa9cdeda2b18fa2c28faabf8fabd2eda2c3557d8fa9cdeda2b18fa2c28faabf8fabd2eda2c3557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)