To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鸚??泣??誘レ?^ 1110101001011111001111110011111110001011100000110011111100111111100101110101010110000011100011000011111101011110 ea5f3f3f8b833f3f9755838c3f5e
EUC-JP 鸚??泣??誘レ?^ 1111001111000000001111110011111110110101111000110011111100111111110011011011011010100101111011000011111101011110 f3c03f3fb5e33f3fcdb6a5ec3f5e
UTF-8 鸚룹꼹泣딉㏊誘レ뒅^ 11101001101110001001101011101011101000111011100111101010101111001011100111100110101100111010001111101011100101001000100111100011100011111000101011101000101010101001100011100011100000111010110011101011100100101000010101011110 e9b89aeba3b9eabcb9e6b3a3eb9489e38f8ae8aa98e383aceb92855e
UHC 鸚룹꼹泣딉㏊誘レ뒅^ 11100101101001001011011111101100100001001001000111101011111010001000101011101111101001111011010111101011101011111010101111101100100010101000001101011110 e5a4b7ec8491ebe88aefa7b5ebafabec8a835e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)