To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 沃??餓?ぐ娃??譽??蘖?ぐ娃??沃??擁?? 10010111100000000011111100111111100010011110110000111111100000101010111010001000101000010011111100111111111001101010001100111111001111111001111101010000001111111000001010101110100010001010000100111111001111111001011110000000001111110011111110010111011010010011111100111111 97803f3f89ec3f82ae88a13f3fe6a33f3f9f503f82ae88a13f3f97803f3f97693f3f
EUC-JP 沃??餓?ぐ娃??譽??蘖?ぐ娃??沃??擁?? 11001101111000000011111100111111101100101110111000111111101001001011000010110000101000110011111100111111111011001010010100111111001111111101110110110001001111111010010010110000101100001010001100111111001111111100110111100000001111110011111111001101110010100011111100111111 cde03f3fb2ee3fa4b0b0a33f3feca53f3fddb13fa4b0b0a33f3fcde03f3fcdca3f3f
UTF-8 沃곈걶餓뽬ぐ娃쒏릍譽길뿈蘖띹ぐ娃쒍퍟沃곈걶擁녑렘 111001101011001010000011111010101011001110001000111010101011000110110110111010011010010010010011111010111011110110101100111000111000000110010000111001011010100010000011111011001001001010001111111010111010011010001101111010001010110110111101111010101011100010111000111010111011111110001000111010001001100010010110111010111001110110111001111000111000000110010000111001011010100010000011111011001001001010001101111011011000110110011111111001101011001010000011111010101011001110001000111010101011000110110110111001101001001110000001111010111000010110010001111010111010000010011000 e6b283eab388eab1b6e9a493ebbdace38190e5a883ec928feba68de8adbdeab8b8ebbf88e89896eb9db9e38190e5a883ec928ded8d9fe6b283eab388eab1b6e69381eb8591eba098
UHC 沃곈걶餓뽬ぐ娃쒏릍譽길뿈蘖띹ぐ娃쒍퍟沃곈걶擁녑렘 111010001010101010110000111010011000000110011100111001001011101110010110111010001010101010110000111010001101111110011100111001101011100010101100111001111110001010110001111001101001011110001111111001011110111010001101111010001010101010110000111010001101111110011100111001001011101110010110111010001010101010110000111010011000000110011100111010001011011010110011111001011011011110111101 e8aab0e9819ce4bb96e8aab0e8df9ce6b8ace7e2b1e6978fe5ee8de8aab0e8df9ce4bb96e8aab0e9819ce8b6b3e5b7bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)