To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 予??魏??諛?? 100101110101110000111111001111111110100110110000001111110011111111100110100001110011111100111111 975c3f3fe9b03f3fe6873f3f
EUC-JP 予??魏??諛?? 110011011011110100111111001111111111001010110010001111110011111111101011111001110011111100111111 cdbd3f3ff2b23f3febe73f3f
UTF-8 予쀬옊魏섊럤諛깅뇿 111001001011101010001000111011001000000010101100111011001001100010001010111010011010110110001111111011001000010010001010111010111001111110100100111010001010101110011011111010101011100110000101111010111000011110111111 e4ba88ec80acec988ae9ad8fec848aeb9fa4e8ab9beab985eb87bf
UHC 予쀬옊魏섊럤諛깅뇿 111001011111100010010111111011001001111010010010111010101110000010011000111001111000111010000111111010111011000010110001111010111000011110100000 e5f897ec9e92eae098e78e87ebb0b1eb87a0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)