To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 餓?ⅹ沃?ぐ譽??[餓?ⅹ沃?ぐ譽??[^ 10001001111011000011111111111010010010011001011110000000001111111000001010101110111001101010001100111111001111110101101110001001111011000011111111111010010010011001011110000000001111111000001010101110111001101010001100111111001111110101101101011110 89ec3ffa4997803f82aee6a33f3f5b89ec3ffa4997803f82aee6a33f3f5b5e
EUC-JP 餓??沃?ぐ譽??[餓??沃?ぐ譽??[^ 1011001011101110001111110011111111001101111000000011111110100100101100001110110010100101001111110011111101011011101100101110111000111111001111111100110111100000001111111010010010110000111011001010010100111111001111110101101101011110 b2ee3f3fcde03fa4b0eca53f3f5bb2ee3f3fcde03fa4b0eca53f3f5b5e
UTF-8 餓뽩ⅹ沃계ぐ譽길짎[餓뽩ⅹ沃계ぐ譽길짎[^ 111010011010010010010011111010111011110110101001111000101000010110111001111001101011001010000011111010101011001110000100111000111000000110010000111010001010110110111101111010101011100010111000111011001010011110001110010110111110100110100100100100111110101110111101101010011110001010000101101110011110011010110010100000111110101010110011100001001110001110000001100100001110100010101101101111011110101010111000101110001110110010100111100011100101101101011110 e9a493ebbda9e285b9e6b283eab384e38190e8adbdeab8b8eca78e5be9a493ebbda9e285b9e6b283eab384e38190e8adbdeab8b8eca78e5b5e
UHC 餓뽩ⅹ沃계ぐ譽길짎[餓뽩ⅹ沃계ぐ譽길짎[^ 111001001011101110010110111001011010010110101010111010001010101010110000111010001010101010110000111001111110001010110001111001101010001110011010010110111110010010111011100101101110010110100101101010101110100010101010101100001110100010101010101100001110011111100010101100011110011010100011100110100101101101011110 e4bb96e5a5aae8aab0e8aab0e7e2b1e6a39a5be4bb96e5a5aae8aab0e8aab0e7e2b1e6a39a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)