To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 禎?刀遵?R禎?刀遵?^[禎?刀遵?R禎?刀遵?^[^ 100100101111010100111111100100111000000110001111100001010011111101010010100100101111010100111111100100111000000110001111100001010011111101011110010110111001001011110101001111111001001110000001100011111000010100111111010100101001001011110101001111111001001110000001100011111000010100111111010111100101101101011110 92f53f93818f853f5292f53f93818f853f5e5b92f53f93818f853f5292f53f93818f853f5e5b5e
EUC-JP 禎?刀遵?R禎?刀遵?^[禎?刀遵?R禎?刀遵?^[^ 110001001111011100111111110001011110000110111101111001010011111101010010110001001111011100111111110001011110000110111101111001010011111101011110010110111100010011110111001111111100010111100001101111011110010100111111010100101100010011110111001111111100010111100001101111011110010100111111010111100101101101011110 c4f73fc5e1bde53f52c4f73fc5e1bde53f5e5bc4f73fc5e1bde53f52c4f73fc5e1bde53f5e5b5e
UTF-8 禎대刀遵둑R禎대刀遵둑^[禎대刀遵둑R禎대刀遵둑^[^ 11100111101001101000111011101011100011001000000011100101100010001000000011101001100000011011010111101011100100011001000101010010111001111010011010001110111010111000110010000000111001011000100010000000111010011000000110110101111010111001000110010001010111100101101111100111101001101000111011101011100011001000000011100101100010001000000011101001100000011011010111101011100100011001000101010010111001111010011010001110111010111000110010000000111001011000100010000000111010011000000110110101111010111001000110010001010111100101101101011110 e7a68eeb8c80e58880e981b5eb919152e7a68eeb8c80e58880e981b5eb91915e5be7a68eeb8c80e58880e981b5eb919152e7a68eeb8c80e58880e981b5eb91915e5b5e
UHC 禎대刀遵둑R禎대刀遵둑^[禎대刀遵둑R禎대刀遵둑^[^ 1110111111101110101101001110101111010011111011111111000111100101101101011100111101010010111011111110111010110100111010111101001111101111111100011110010110110101110011110101111001011011111011111110111010110100111010111101001111101111111100011110010110110101110011110101001011101111111011101011010011101011110100111110111111110001111001011011010111001111010111100101101101011110 efeeb4ebd3eff1e5b5cf52efeeb4ebd3eff1e5b5cf5e5befeeb4ebd3eff1e5b5cf52efeeb4ebd3eff1e5b5cf5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)