To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 癌??鎰??魏?? 100010101110000000111111001111111110100001001100001111110011111111101001101100000011111100111111 8ae03f3fe84c3f3fe9b03f3f
EUC-JP 癌??鎰??魏?? 101101001110001000111111001111111110111110101101001111110011111111110010101100100011111100111111 b4e23f3fefad3f3ff2b23f3f
UTF-8 癌껓퐦鎰쒐독魏귥뜓 111001111001100110001100111010101011101110010011111011011001000010100110111010011000111010110000111011001001001010010000111010111000111110000101111010011010110110001111111010101011011110100101111010111001110010010011 e7998ceabb93ed90a6e98eb0ec9290eb8f85e9ad8feab7a5eb9c93
UHC 癌껓퐦鎰쒐독魏귥뜓 111001001101111110000011111011111011110110001111111011001111000010011100111001111011010110110110111010101110000010000010111011001000110110010110 e4df83efbd8fecf09ce7b5b6eae082ec8d96

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)