To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 闇μ?鎰??飮?? 10001000110001011000001111001010001111111110100001001100001111110011111110011111010110100011111100111111 88c583ca3fe84c3f3f9f5a3f3f
EUC-JP 闇μ?鎰??飮?? 10110000110001111010011011001100001111111110111110101101001111110011111111011101101110110011111100111111 b0c7a6cc3fefad3f3fddbb3f3f
UTF-8 闇μ슱鎰쇿쮦飮딅걙 1110100110010111100001111100111010111100111011001000101010110001111010011000111010110000111011001000011110111111111011001010111010100110111010011010001110101110111010111001010010000101111010101011000110011001 e99787cebcec8ab1e98eb0ec87bfecaea6e9a3aeeb9485eab199
UHC 闇μ슱鎰쇿쮦飮딅걙 111001001110000110100101111011001001101010111000111011001111000010011001111001011010100010000011111010111110011010001010111010111000000110000011 e4e1a5ec9ab8ecf099e5a883ebe68aeb8183

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)