To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
SJIS-WIN 迢ク譌ヲ逾幽迢ク譌ヲ逾由迢ク譌ヲ逾輸gB 1110011110001011101110001110011010010111101001101110011110100101100101110100100011100111100010111011100011100110100101111010011011100111101001011001011101010010111001111000101110111000111001101001011110100110111001111010010110010111010000010110011101000010 e78bb8e697a6e7a59748e78bb8e697a6e7a59752e78bb8e697a6e7a597416742
EUC-JP 迢ク譌ヲ逾幽迢ク譌ヲ逾由迢ク譌ヲ逾輸gB 1110110111101011100011101011100011101011111101111000111010100110111011101010011111001101101010011110110111101011100011101011100011101011111101111000111010100110111011101010011111001101101100111110110111101011100011101011100011101011111101111000111010100110111011101010011111001101101000100110011101000010 edeb8eb8ebf78ea6eea7cda9edeb8eb8ebf78ea6eea7cdb3edeb8eb8ebf78ea6eea7cda26742
UTF-8 迢ク譌ヲ逾幽迢ク譌ヲ逾由迢ク譌ヲ逾輸gB 1110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100110000000101111101110010110111001101111011110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100110000000101111101110011110010100101100011110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100110000000101111101110100010111100101110000110011101000010 e8bfa2efbdb8e8ad8cefbda6e980bee5b9bde8bfa2efbdb8e8ad8cefbda6e980bee794b1e8bfa2efbdb8e8ad8cefbda6e980bee8bcb86742
UHC ????逾幽????逾由????逾輸gB 0011111100111111001111110011111111101011101101011110101011101011001111110011111100111111001111111110101110110101111010111010011000111111001111110011111100111111111010111011010111100010110000110110011101000010 3f3f3f3febb5eaeb3f3f3f3febb5eba63f3f3f3febb5e2c36742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)