To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????q????????????O 00111111001111110011111100111111001111110011111100111111001111110111000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001111 3f3f3f3f3f3f3f3f713f3f3f3f3f3f3f3f3f3f3f3f4f
SJIS-WIN 迢ク譌ヲ諱ッ邯サq迢ク譌ヲ謐韻迢ク譌ヲ謐隠O 11100111100010111011100011100110100101111010011011100110100000011010111111100111101101101011101101110001111001111000101110111000111001101001011110100110111001101000110110001001010000111110011110001011101110001110011010010111101001101110011010001101100010010100001001001111 e78bb8e697a6e681afe7b6bb71e78bb8e697a6e68d8943e78bb8e697a6e68d89424f
EUC-JP 迢ク譌ヲ諱ッ邯サq迢ク譌ヲ謐韻迢ク譌ヲ謐隠O 111011011110101110001110101110001110101111110111100011101010011011101011111000011000111010101111111011101011100010001110101110110111000111101101111010111000111010111000111010111111011110001110101001101110101111101101101100011010010011101101111010111000111010111000111010111111011110001110101001101110101111101101101100011010001101001111 edeb8eb8ebf78ea6ebe18eafeeb88ebb71edeb8eb8ebf78ea6ebedb1a4edeb8eb8ebf78ea6ebedb1a34f
UTF-8 迢ク譌ヲ諱ッ邯サq迢ク譌ヲ謐韻迢ク譌ヲ謐隠O 1110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100010101011101100011110111110111101101011111110100110000010101011111110111110111101101110110111000111101000101111111010001011101111101111011011100011101000101011011000110011101111101111011010011011101000101011001001000011101001100111111011101111101000101111111010001011101111101111011011100011101000101011011000110011101111101111011010011011101000101011001001000011101001100110101010000001001111 e8bfa2efbdb8e8ad8cefbda6e8abb1efbdafe982afefbdbb71e8bfa2efbdb8e8ad8cefbda6e8ac90e99fbbe8bfa2efbdb8e8ad8cefbda6e8ac90e99aa04f
UHC ????諱?邯?q????謐韻????謐?O 001111110011111100111111001111111111110111001001001111111100101011111011001111110111000100111111001111110011111100111111110110101100110111101010101001000011111100111111001111110011111111011010110011010011111101001111 3f3f3f3ffdc93fcafb3f713f3f3f3fdacdeaa43f3f3f3fdacd3f4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)