To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????oBF 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6f4246
SJIS-WIN 鸚??逸??音?????飮??伎逸?oBF 111010100101111100111111001111111000100011101101001111110011111110001001101110010011111100111111001111110011111100111111100111110101101000111111001111111000101011101010100010001110110100111111011011110100001001000110 ea5f3f3f88ed3f3f89b93f3f3f3f3f9f5a3f3f8aea88ed3f6f4246
EUC-JP 鸚??逸??音?????飮??伎逸?oBF 111100111100000000111111001111111011000011101111001111110011111110110010101110110011111100111111001111110011111100111111110111011011101100111111001111111011010011101100101100001110111100111111011011110100001001000110 f3c03f3fb0ef3f3fb2bb3f3f3f3f3fddbb3f3fb4ecb0ef3f6f4246
UTF-8 鸚쒖눦逸녑쩂音쎌댅凉깅냵飮긷럳伎逸턭oBF 111010011011100010011010111011001001001010010110111010111000100010100110111010011000000010111000111010111000010110010001111011001010100110000010111010011001111110110011111011001000111010001100111010111000110010000101111011111010010110111001111010101011100110000101111010111000001110110101111010011010001110101110111010101011100010110111111010111001111110110011111001001011110010001110111010011000000010111000111011011000010010101101011011110100001001000110 e9b89aec9296eb88a6e980b8eb8591eca982e99fb3ec8e8ceb8c85efa5b9eab985eb83b5e9a3aeeab8b7eb9fb3e4bc8ee980b8ed84ad6f4246
UHC 鸚쒖눦逸녑쩂音쎌댅凉깅냵飮긷럳伎逸턭oBF 111001011010010010011100111011001000011110111101111011001110111110110011111001011010010010011100111010111110010110111101111011001000100010101111111001011011110010110001111010111000011010000101111010111110011010110001111001011000111010010011110100001110101111101100111011111011011001101110011011110100001001000110 e5a49cec87bdecefb3e5a49cebe5bdec88afe5bcb1eb8685ebe6b1e58e93d0ebecefb66e6f4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)