To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 業??椅??筍щ? 10001011110001100011111100111111100010001101011000111111001111111110001010100001100001001000101100111111 8bc63f3f88d63f3fe2a1848b3f
EUC-JP 業??椅??筍щ? 10110110110010000011111100111111101100001101100000111111001111111110010010100011101001111110101100111111 b6c83f3fb0d83f3fe4a3a7eb3f
UTF-8 業볥굛椅쇗꼮筍щ쇂 1110011010100101101011011110101110110011101001011110101010110101100110111110011010100100100001011110110010000111100101111110101010111100101011101110011110101101100011011101000110001001111011001000011110000010 e6a5adebb3a5eab59be6a485ec8797eabcaee7ad8dd189ec8782
UHC 業볥굛椅쇗꼮筍щ쇂 111001011111011010010011111010111000001010000011111010111111010110111100111001101000010010001001111000101110110010101100111010111001100110110110 e5f693eb8283ebf5bce68489e2ecaceb99b6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)