To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭?ぜ飮??歪?9幼?????? 00111111001111110011111110010000011110000011111110000010101110101001111101011010001111110011111110011000011000110011111110000010010110001001011101100011001111110011111100111111001111110011111100111111 3f3f3f90783f82ba9f5a3f3f98633f825897633f3f3f3f3f3f
EUC-JP ???靭?ぜ飮??歪?9幼??洧??孼 0011111100111111001111111011111111011001001111111010010010111100110111011011101100111111001111111100111111000100001111111010001110111001110011011100010000111111001111111000111111000111101101000011111100111111100011111011101011000011 3f3f3fbfd93fa4bcddbb3f3fcfc43fa3b9cdc43f3f8fc7b43f3f8fbac3
UTF-8 麗몃쓷靭딂ぜ飮낇꼤歪묐9幼끾갭洧뷀뜙孼 111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001010010000010111000111000000110011100111010011010001110101110111010111000001010000111111010101011110010100100111001101010110110101010111010111010110010010000111011111011110010011001111001011011100110111100111010111000000110111110111010101011000010101101111001101011010010100111111010111011011110000000111010111001110010011001111001011010110110111100 efa688ebaa83ec93b7e99dadeb9482e3819ce9a3aeeb8287eabca4e6adaaebac90efbc99e5b9bceb81beeab0ade6b4a7ebb780eb9c99e5adbc
UHC 麗몃쓷靭딂ぜ飮낇꼤歪묐9幼끾갭洧뷀뜙孼 1110011010110000101110001110101110011101100101001110110011100101100010101110100010101010101111001110101111100110100001011110110110000100100000011110100011100000100100011110101110100011101110011110101011101010100001011110011010110000101110001110101011111011100101001110110110001101100111001110010111101101 e6b0b8eb9d94ece58ae8aabcebe685ed8481e8e091eba3b9eaea85e6b0b8eafb94ed8d9ce5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)