To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 澱???全?才?殿?靖???禎?愴應? 100100110110001000111111001111110011111110010001010100110011111110001101110010110011111110010011011000010011111110010110111101010011111100111111001111111001001011110101001111111001110011000110100111001110010000111111 93623f3f3f91533f8dcb3f93613f96f53f3f3f92f53f9cc69ce43f
EUC-JP 澱???全?才?殿?靖???禎?愴應? 110001011100001100111111001111110011111111000001101101000011111110111010110011010011111111000101110000100011111111001100111101110011111100111111001111111100010011110111001111111101100011001000110110001110011000111111 c5c33f3f3fc1b43fbacd3fc5c23fccf73f3f3fc4f73fd8c8d8e63f
UTF-8 澱ㆁ렰렕全렖才렯殿렔靖ㆁ렰렕禎꿩愴應렩 111001101011111010110001111000111000011010000001111010111010000010110000111010111010000010010101111001011000010110101000111010111010000010010110111001101000100110001101111010111010000010101111111001101010111010111111111010111010000010010100111010011001110110010110111000111000011010000001111010111010000010110000111010111010000010010101111001111010011010001110111010101011111110101001111001101000010010110100111001101000011110001001111010111010000010101001 e6beb1e38681eba0b0eba095e585a8eba096e6898deba0afe6aebfeba094e99d96e38681eba0b0eba095e7a68eeabfa9e684b4e68789eba0a9
UHC 澱ㆁ렰렕全렖才렯殿렔靖ㆁ렰렕禎꿩愴應렩 1110111011111110101001001111000110001110101111011000111010101010111011101110111110001110101010111110111010100110100011101011110011101110111111001000111010101001111011111111111010100100111100011000111010111101100011101010101011101111111011101011001011100110111100111110000111101011111010111000111010110111 eefea4f18ebd8eaaeeef8eabeea68ebceefc8ea9effea4f18ebd8eaaefeeb2e6f3e1ebeb8eb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)