To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 諺??椅??喩??疫???汚???わⅦ儒?ⅨB 1000110010111111001111110011111110001000110101100011111100111111100110100110011100111111001111111000100101110101001111110011111100111111100010011001100000111111001111110011111110000010111011011000011101011010100011101111001000111111100001110101110001000010 8cbf3f3f88d63f3f9a673f3f89753f3f3f89983f3f3f82ed875a8ef23f875c42
EUC-JP 諺??椅??喩??疫???汚??瑗わ?儒??B 1011100011000001001111110011111110110000110110000011111100111111110100111100100000111111001111111011000111010110001111110011111100111111101100011111100000111111001111111000111111001100110000001010010011101111001111111011110011110100001111110011111101000010 b8c13f3fb0d83f3fd3c83f3fb1d63f3f3fb1f83f3f8fccc0a4ef3fbcf43f3f42
UTF-8 諺⑸쉼椅뷰풌喩먭퍢疫꽝띕릎汚삳슢瑗わⅦ儒좊ⅨB 11101000101010111011101011100010100100011011100011101100100010011011110011100110101001001000010111101011101101111011000011101101100100101000110011100101100101101010100111101011101010001010110111101101100011011010001011100111100101101010101111101010101111011001110111101011100111011001010111101011101001101000111011100110101100011001101011101100100000101011001111101100100010101010001011100111100100011001011111100011100000101000111111100010100001011010011011100101100001001001001011101100101000101000101011100010100001011010100001000010 e8abbae291b8ec89bce6a485ebb7b0ed928ce596a9eba8aded8da2e796abeabd9deb9d95eba68ee6b19aec82b3ec8aa2e79197e3828fe285a6e58492eca28ae285a842
UHC 諺⑸쉼椅뷰풌喩먭퍢疫꽝띕릎汚삳슢瑗わⅦ儒좊ⅨB 111001011110110010101001111010111011110110110000111010111111010110111010111001001011111010010001111010101110011110010000111010101011101110011001111001101011100110110010110011101011011011101011101110001010110111100111111111011011101111101011100110101010111011101010101111001010101011101111101001011011011011101010111000111010000011101011101001011011100001000010 e5eca9ebbdb0ebf5bae4be91eae790eabb99e6b9b2ceb6ebb8ade7fdbbeb9aaeeabcaaefa5b6eae3a0eba5b842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)