To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 劾???餘?? 100010100100111000111111001111110011111111101001010100000011111100111111 8a4e3f3f3fe9503f3f
EUC-JP 劾?璵?餘?? 1011001110101111001111111000111111001100111001100011111111110001101100010011111100111111 b3af3f8fcce63ff1b13f3f
UTF-8 劾숇璵뺣餘술땐 111001011000101010111110111011001000100010000111111001111001001010110101111010111011101010100011111010011010010010011000111011001000100010100000111010111001010110010000 e58abeec8887e792b5ebbaa3e9a498ec88a0eb9590
UHC 劾숇璵뺣餘술땐 1111101010110110100110011110101111100110101001011001010111101011111001101010111010111100111110101011011010101001 fab699ebe6a595ebe6aebcfab6a9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)