To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 億??二??怨?? 100010011010110100111111001111111001001111110001001111110011111110001001100001010011111100111111 89ad3f3f93f13f3f89853f3f
EUC-JP 億??二??怨?? 101100101010111100111111001111111100011011110011001111110011111110110001111001010011111100111111 b2af3f3fc6f33f3fb1e53f3f
UTF-8 億륁궠二긷슫怨멸틕 111001011000010010000100111010111010010110000001111010101011011010100000111001001011101010001100111010101011100010110111111011001000101010101011111001101000000010101000111010111010100110111000111011011000101110010101 e58484eba581eab6a0e4ba8ceab8b7ec8aabe680a8eba9b8ed8b95
UHC 億륁궠二긷슫怨멸틕 111001011110001010001111111011001000001010110011111011001010001110110001111001011001101010110100111010101011001110111000111010101011101010000011 e5e28fec82b3eca3b1e59ab4eab3b8eaba83

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)