To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???倚ф?喩??嚥▲????純??吾??異 001111110011111100111111100110001101111110000100100001100011111110011010011001110011111100111111100110101000101110000001101000110011111100111111001111110011111110001111100000110011111100111111100011001110000100111111001111111000100011011001 3f3f3f98df84863f9a673f3f9a8b81a33f3f3f3f8f833f3f8ce13f3f88d9
EUC-JP ???倚ф?喩??嚥▲????純??吾??異 001111110011111100111111110100001110000110100111111001100011111111010011110010000011111100111111110100111110101110100010101001010011111100111111001111110011111110111101111000110011111100111111101110001110001100111111001111111011000011011011 3f3f3fd0e1a7e63fd3c83f3fd3eba2a53f3f3f3fbde33f3fb8e33f3fb0db
UTF-8 捻뀁빓倚ф츎喩쎼렃嚥▲렞溜띶윜純볥샑吾몄뜾異 1110111110100110101001001110101110000000100000011110101110111001100100111110010110000000100110101101000110000100111011001011100010001110111001011001011010101001111011001000111010111100111010111010000010000011111001011001101010100101111000101001011010110010111010111010000010011110111011111010011110001011111010111001110110110110111011001001110010011100111001111011010010010100111010111011001110100101111011001000001110010001111001011001000010111110111010111010101010000100111010111001110010111110111001111001010110110000 efa6a4eb8081ebb993e5809ad184ecb88ee596a9ec8ebceba083e59aa5e296b2eba09eefa78beb9db6ec9c9ce7b494ebb3a5ec8391e590beebaa84eb9cbee795b0
UHC 捻뀁빓倚ф츎喩쎼렃嚥▲렞溜띶윜純볥샑吾몄뜾異 1110011011110111101100101110110010010101101101111110101111101111101011001110011010101110100010011110101011100111100110111110001110001110100111011110011010111111101000011110001110001110101011111110101011111110100011011110010110011111100111111110001011101101100100111110101110011000101111101110011111101110101110001110110010001101101110011110110010110110 e6f7b2ec95b7ebeface6ae89eae79be38e9de6bfa1e38eafeafe8de59f9fe2ed93eb98bee7eeb8ec8db9ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)