To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???伊??音??癰??認??魏??筌μ?? 0011111100111111001111111000100011001001001111110011111110001001101110010011111100111111111000011001111000111111001111111001010001000110001111110011111111101001101100000011111100111111111000101010001110000011110010100011111100111111 3f3f3f88c93f3f89b93f3fe19e3f3f94463f3fe9b03f3fe2a383ca3f3f
EUC-JP ???伊??音??癰??認??魏??筌μ?靷 00111111001111110011111110110000110010110011111100111111101100101011101100111111001111111110000111111110001111110011111111000111101001110011111100111111111100101011001000111111001111111110010010100101101001101100110000111111100011111110011110111101 3f3f3fb0cb3f3fb2bb3f3fe1fe3f3fc7a73f3ff2b23f3fe4a5a6cc3f8fe7bd
UTF-8 嶪용뜆伊쒏룚音쀫껜癰귥쥒認됮땡魏낆벁筌μ떓靷 1110010110110110101010101110110010011010101010011110101110011100100001101110010010111100100010101110110010010010100011111110101110100011100110101110100110011111101100111110110010000000101010111110101010111011100111001110011110011001101100001110101010110111101001011110110010100101100100101110100010101010100011011110101110010000101011101110101110010101101000011110100110101101100011111110101110000010100001101110101110110010100000011110011110101101100011001100111010111100111010111001011010010011111010011001110110110111 e5b6aaec9aa9eb9c86e4bc8aec928feba39ae99fb3ec80abeabb9ce799b0eab7a5eca592e8aa8deb90aeeb95a1e9ad8feb8286ebb281e7ad8ccebceb9693e99db7
UHC 嶪용뜆伊쒏룚音쀫껜癰귥쥒認됮땡魏낆벁筌μ떓靷 1110010111110101101111111110101110001101100010011110110010100101100111001110011010001111100101101110101111100101100101111110101110110010101101001110100010111001100000101110110010100010100010011110110011100011100010011110100110110110101011111110101011100000100001011110110010010011101001111110111110100111101001011110110010001011101010011110110011100110 e5f5bfeb8d89eca59ce68f96ebe597ebb2b4e8b982eca289ece389e9b6afeae085ec93a7efa7a5ec8ba9ece6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)