To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雍?????韋??癰????Ⅶ儒??阿??蚓 1110100010110100001111110011111100111111001111110011111111101000111010000011111100111111111000011001111000111111001111110011111100111111100001110101101010001110111100100011111100111111100010001010001000111111001111111110010101101101 e8b43f3f3f3f3fe8e83f3fe19e3f3f3f3f875a8ef23f3f88a23f3fe56d
EUC-JP 雍?????韋??癰?????儒??阿??蚓 11110000101101100011111100111111001111110011111100111111111100001110101000111111001111111110000111111110001111110011111100111111001111110011111110111100111101000011111100111111101100001010010000111111001111111110100111001110 f0b63f3f3f3f3ff0ea3f3fe1fe3f3f3f3f3fbcf43f3fb0a43f3fe9ce
UTF-8 雍우궡劉닷퐲韋살툙癰귥쥓吏껓Ⅶ儒몄구阿숆남蚓 111010011001101110001101111011001001101010110000111010101011011010100001111011111010011110000111111010111000101110110111111011011001000010110010111010011001111110001011111011001000001010110100111011011000100010011001111001111001100110110000111010101011011110100101111011001010010110010011111011111010011110011110111010101011101110010011111000101000010110100110111001011000010010010010111010111010101010000100111010101011010110101100111010011001100010111111111011001000100010000110111010111000001010101000111010001001101010010011 e99b8dec9ab0eab6a1efa787eb8bb7ed90b2e99f8bec82b4ed8899e799b0eab7a5eca593efa79eeabb93e285a6e58492ebaa84eab5ace998bfec8886eb82a8e89a93
UHC 雍우궡劉닷퐲韋살툙癰귥쥓吏껓Ⅶ儒몄구阿숆남蚓 1110100010111100101111111110110010000010101101001110101011100101101101001110010110111101100110111110101011011111101110111110110010111000100100001110100010111001100000101110110010100010100010101110110010100111100000111110111110100101101101101110101011100011101110001110110010110001101110001110010010111001100110011110101010110011101100101110110011100010 e8bcbfec82b4eae5b4e5bd9beadfbbecb890e8b982eca28aeca783efa5b6eae3b8ecb1b8e4b999eab3b2ece2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)