To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??裕??儒??厓??誼↓?擬??ぜ 100110001101101000111111001111111001011101010100001111110011111110001110111100100011111100111111111110101000110100111111001111111000101101100010100000011010101100111111100010110101101100111111001111111000001010111010 98da3f3f97543f3f8ef23f3ffa8d3f3f8b6281ab3f8b5b3f3f82ba
EUC-JP 俑??裕??儒??厓??誼↓?擬??ぜ 11010000110111000011111100111111110011011011010100111111001111111011110011110100001111110011111110001111101101001100011100111111001111111011010111000011101000101010110100111111101101011011110000111111001111111010010010111100 d0dc3f3fcdb53f3fbcf43f3f8fb4c73f3fb5c3a2ad3fb5bc3f3fa4bc
UTF-8 俑앹늿裕뉓짆儒삠걶厓쀬눖誼↓뒽擬쑩딂ぜ 111001001011111110010001111011001001010110111001111010111000101010111111111010001010001110010101111010111000100110010011111011001010011110000110111001011000010010010010111011001000001010100000111010101011000110110110111001011000111010010011111011001000000010101100111010111000100010010110111010001010101010111100111000101000011010010011111010111001001010111101111001101001001110101100111011001001000110101001111010111001010010000010111000111000000110011100 e4bf91ec95b9eb8abfe8a395eb8993eca786e58492ec82a0eab1b6e58e93ec80aceb8896e8aabce28693eb92bde693acec91a9eb9482e3819c
UHC 俑앹늿裕뉓짆儒삠걶厓쀬눖誼↓뒽擬쑩딂ぜ 1110100110110101100111011110110010001000100010001110101110101110100001111110100010100011100101011110101011100011101110111110001110000001100111001110010011101101100101111110110010000111101100001110101111111110101000011110100110001010101100111110101111110100100111001100010110001010111010001010101010111100 e9b59dec8888ebae87e8a395eae3bbe3819ce4ed97ec87b0ebfea1e98ab3ebf49cc58ae8aabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)