To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲??幽??魏??^ 11100001100111110011111100111111100101110100100000111111001111111110100110110000001111110011111101011110 e19f3f3f97483f3fe9b03f3f5e
EUC-JP 癲??幽??魏??^ 11100010101000010011111100111111110011011010100100111111001111111111001010110010001111110011111101011110 e2a13f3fcda93f3ff2b23f3f5e
UTF-8 癲뉖굞幽곤쫯魏녿뜲^ 11100111100110011011001011101011100010011001011011101010101101011001111011100101101110011011110111101010101100111010010011101100101010111010111111101001101011011000111111101011100001011011111111101011100111001011001001011110 e799b2eb8996eab59ee5b9bdeab3a4ecabafe9ad8feb85bfeb9cb25e
UHC 癲뉖굞幽곤쫯魏녿뜲^ 11101111101001101000011111101011100000101000011011101010111010111011000011101111101001101000011111101010111000001000011011101011100011011011000001011110 efa687eb8286eaebb0efa687eae086eb8db05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)