To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??乳??扱苡ф? 111000011001111100111111001111111001001111111011001111110011111110001000101101011110010010001111100001001000011000111111 e19f3f3f93fb3f3f88b5e48f84863f
EUC-JP 癲??乳??扱苡ф? 111000101010000100111111001111111100011011111101001111110011111110110000101101111110011111101111101001111110011000111111 e2a13f3fc6fd3f3fb0b7e7efa7e63f
UTF-8 癲숈슜乳꿩룚扱苡ф뉩 1110011110011001101100101110110010001000100010001110110010001010100111001110010010111001101100111110101010111111101010011110101110100011100110101110011010001001101100011110100010001011101000011101000110000100111010111000100110101001 e799b2ec8888ec8a9ce4b9b3eabfa9eba39ae689b1e88ba1d184eb89a9
UHC 癲숈슜乳꿩룚扱苡ф뉩 1110111110100110100110011110110010011010101010011110101011100001101100101110011010001111100101101101000011100010111011001011111010101100111001101011010010111001 efa699ec9aa9eae1b2e68f96d0e2ecbeace6b4b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)