To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 丹俗存辿損旦棚属 10010010010011111001000110101101100100011011011010010010010010001001000110111001100100100101010110010010010010011001000110101110 924f91ad91b6924891b99255924991ae
EUC-JP 丹俗存辿損旦棚属 11000011101100001100001010101111110000101011100011000011101010011100001010111011110000111011011011000011101010101100001010110000 c3b0c2afc2b8c3a9c2bbc3b6c3aac2b0
UTF-8 丹俗存辿損旦棚属 111001001011100010111001111001001011111110010111111001011010110110011000111010001011111010111111111001101001000010001101111001101001011110100110111001101010001110011010111001011011000110011110 e4b8b9e4bf97e5ad98e8bebfe6908de697a6e6a39ae5b19e
UHC 丹俗存?損旦棚? 1101001110100001111000011101010011110000111011010011111111100001110111111101001110101001110111011101110000111111 d3a1e1d4f0ed3fe1dfd3a9dddc3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)