To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 癰????R癰????^[癰????R癰????^[^ 11100001100111100011111100111111001111110011111101010010111000011001111000111111001111110011111100111111010111100101101111100001100111100011111100111111001111110011111101010010111000011001111000111111001111110011111100111111010111100101101101011110 e19e3f3f3f3f52e19e3f3f3f3f5e5be19e3f3f3f3f52e19e3f3f3f3f5e5b5e
EUC-JP 癰????R癰????^[癰????R癰????^[^ 11100001111111100011111100111111001111110011111101010010111000011111111000111111001111110011111100111111010111100101101111100001111111100011111100111111001111110011111101010010111000011111111000111111001111110011111100111111010111100101101101011110 e1fe3f3f3f3f52e1fe3f3f3f3f5e5be1fe3f3f3f3f52e1fe3f3f3f3f5e5b5e
UTF-8 癰앓롉앓롉R癰앓롉앓롉^[癰앓롉앓롉R癰앓롉앓롉^[^ 11100111100110011011000011101100100101011001001111101011101000011000100111101100100101011001001111101011101000011000100101010010111001111001100110110000111011001001010110010011111010111010000110001001111011001001010110010011111010111010000110001001010111100101101111100111100110011011000011101100100101011001001111101011101000011000100111101100100101011001001111101011101000011000100101010010111001111001100110110000111011001001010110010011111010111010000110001001111011001001010110010011111010111010000110001001010111100101101101011110 e799b0ec9593eba189ec9593eba18952e799b0ec9593eba189ec9593eba1895e5be799b0ec9593eba189ec9593eba18952e799b0ec9593eba189ec9593eba1895e5b5e
UHC 癰앓롉앓롉R癰앓롉앓롉^[癰앓롉앓롉R癰앓롉앓롉^[^ 1110100010111001101111101100111010001110110011111011111011001110100011101100111101010010111010001011100110111110110011101000111011001111101111101100111010001110110011110101111001011011111010001011100110111110110011101000111011001111101111101100111010001110110011110101001011101000101110011011111011001110100011101100111110111110110011101000111011001111010111100101101101011110 e8b9bece8ecfbece8ecf52e8b9bece8ecfbece8ecf5e5be8b9bece8ecfbece8ecf52e8b9bece8ecfbece8ecf5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)