To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????~K 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111111001001011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7e4b
SJIS-WIN 迢ク譌ヲ謐烏迢ク譌ヲ謐宇迢ク譌ヲ諱ッ邯サ~K 11100111100010111011100011100110100101111010011011100110100011011000100101000111111001111000101110111000111001101001011110100110111001101000110110001001010001101110011110001011101110001110011010010111101001101110011010000001101011111110011110110110101110110111111001001011 e78bb8e697a6e68d8947e78bb8e697a6e68d8946e78bb8e697a6e681afe7b6bb7e4b
EUC-JP 迢ク譌ヲ謐烏迢ク譌ヲ謐宇迢ク譌ヲ諱ッ邯サ~K 111011011110101110001110101110001110101111110111100011101010011011101011111011011011000110101000111011011110101110001110101110001110101111110111100011101010011011101011111011011011000110100111111011011110101110001110101110001110101111110111100011101010011011101011111000011000111010101111111011101011100010001110101110110111111001001011 edeb8eb8ebf78ea6ebedb1a8edeb8eb8ebf78ea6ebedb1a7edeb8eb8ebf78ea6ebe18eafeeb88ebb7e4b
UTF-8 迢ク譌ヲ謐烏迢ク譌ヲ謐宇迢ク譌ヲ諱ッ邯サ~K 1110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100010101100100100001110011110000011100011111110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100010101100100100001110010110101110100001111110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100010101011101100011110111110111101101011111110100110000010101011111110111110111101101110110111111001001011 e8bfa2efbdb8e8ad8cefbda6e8ac90e7838fe8bfa2efbdb8e8ad8cefbda6e8ac90e5ae87e8bfa2efbdb8e8ad8cefbda6e8abb1efbdafe982afefbdbb7e4b
UHC ????謐烏????謐宇????諱?邯?~K 00111111001111110011111100111111110110101100110111101000101000010011111100111111001111110011111111011010110011011110100111010100001111110011111100111111001111111111110111001001001111111100101011111011001111110111111001001011 3f3f3f3fdacde8a13f3f3f3fdacde9d43f3f3f3ffdc93fcafb3f7e4b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)