To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??〕揖??邑?? 00111111001111111000000101101100100101110100101100111111001111111001011101010111001111111000000101001000 3f3f816c974b3f3f97573f8148
EUC-JP 艅?〕揖??邑?? 100011111101011011111101001111111010000111001101110011011010110000111111001111111100110110111000001111111010000110101001 8fd6fd3fa1cdcdac3f3fcdb83fa1a9
UTF-8 艅덈〕揖쇤댖邑㏓? 111010001000100110000101111010111000110110001000111000111000000010010101111001101000111110010110111011001000011110100100111010111000110010010110111010011000001010010001111000111000111110010011111011111011110010011111 e88985eb8d88e38095e68f96ec87a4eb8c96e98291e38f93efbc9f
UHC 艅덈〕揖쇤댖邑㏓? 111001101010100110001000111010111010000110110011111010111110011110111100111010011000100010111010111010111110100110100111111010111010001110111111 e6a988eba1b3ebe7bce988baebe9a7eba3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)