To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????p????????????` 0011111100111111001111110011111100111111001111110111000000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101100000 3f3f3f3f3f3f703f3f3f3f3f3f3f3f3f3f3f3f60
SJIS-WIN 迢ク譌ヲ謐浦p迢ク譌ヲ諱ッ迢ク譌冗ィョ` 111001111000101110111000111001101001011110100110111001101000110110001001010110010111000011100111100010111011100011100110100101111010011011100110100000011010111111100111100010111011100011100110100101111000111111100111101010001010111001100000 e78bb8e697a6e68d895970e78bb8e697a6e681afe78bb8e6978fe7a8ae60
EUC-JP 迢ク譌ヲ謐浦p迢ク譌ヲ諱ッ迢ク譌冗ィョ` 1110110111101011100011101011100011101011111101111000111010100110111010111110110110110001101110100111000011101101111010111000111010111000111010111111011110001110101001101110101111100001100011101010111111101101111010111000111010111000111010111111011110111110111010011000111010101000100011101010111001100000 edeb8eb8ebf78ea6ebedb1ba70edeb8eb8ebf78ea6ebe18eafedeb8eb8ebf7bee98ea88eae60
UTF-8 迢ク譌ヲ謐浦p迢ク譌ヲ諱ッ迢ク譌冗ィョ` 1110100010111111101000101110111110111101101110001110100010101101100011001110111110111101101001101110100010101100100100001110011010110101101001100111000011101000101111111010001011101111101111011011100011101000101011011000110011101111101111011010011011101000101010111011000111101111101111011010111111101000101111111010001011101111101111011011100011101000101011011000110011100101100001101001011111101111101111011010100011101111101111011010111001100000 e8bfa2efbdb8e8ad8cefbda6e8ac90e6b5a670e8bfa2efbdb8e8ad8cefbda6e8abb1efbdafe8bfa2efbdb8e8ad8ce58697efbda8efbdae60
UHC ????謐浦p????諱????冗??` 001111110011111100111111001111111101101011001101111110001101110101110000001111110011111100111111001111111111110111001001001111110011111100111111001111111110100110110111001111110011111101100000 3f3f3f3fdacdf8dd703f3f3f3ffdc93f3f3f3fe9b73f3f60

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)