To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 堯???③?耀??踰?????^ 1110101010011111001111110011111100111111100001110100001000111111100101110111001100111111001111111110011011111010001111110011111100111111001111110011111101011110 ea9f3f3f3f87423f97733f3fe6fa3f3f3f3f3f5e
EUC-JP 堯?????耀??踰?????^ 11110100101000010011111100111111001111110011111100111111110011011101010000111111001111111110110011111100001111110011111100111111001111110011111101011110 f4a13f3f3f3f3fcdd43f3fecfc3f3f3f3f3f5e
UTF-8 堯붾쾯戮③젇耀믧뜑踰녿쉔療딅탲^ 11100101101000001010111111101011101101101011111011101100101111101010111111101111101001111001001011100010100100011010001011101100101000001000011111101000100000001000000011101011101011111010011111101011100111001001000111101000101110001011000011101011100001011011111111101100100010011001010011101111101001111000000111101011100101001000010111101101100000111011001001011110 e5a0afebb6beecbeafefa792e291a2eca087e88080ebafa7eb9c91e8b8b0eb85bfec8994efa781eb9485ed83b25e
UHC 堯붾쾯戮③젇耀믧뜑踰녿쉔療딅탲^ 11101000111010111001010011101011101100101000011011101011101111011010100011101001101000001000101011101001101001011001001011101001100011011001010011101011101100101000011011101011101111011010100011101000111111101000101011101011101101011000111101011110 e8eb94ebb286ebbda8e9a08ae9a592e98d94ebb286ebbda8e8fe8aebb58f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)