To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 鶯??誼??筍ъ????鶯??誼??筍ъ????E 111010011111001000111111001111111000101101100010001111110011111111100010101000011000010010001100001111110011111100111111001111111110100111110010001111110011111110001011011000100011111100111111111000101010000110000100100011000011111100111111001111110011111101000101 e9f23f3f8b623f3fe2a1848c3f3f3f3fe9f23f3f8b623f3fe2a1848c3f3f3f3f45
EUC-JP 鶯??誼??筍ъ?倻??鶯??誼??筍ъ?倻??E 11110010111101000011111100111111101101011100001100111111001111111110010010100011101001111110110000111111100011111011000111110110001111110011111111110010111101000011111100111111101101011100001100111111001111111110010010100011101001111110110000111111100011111011000111110110001111110011111101000101 f2f43f3fb5c33f3fe4a3a7ec3f8fb1f63f3ff2f43f3fb5c33f3fe4a3a7ec3f8fb1f63f3f45
UTF-8 鶯ㅳ꺀誼뷸뵺筍ъ쪎倻딅넽鶯ㅳ꺀誼뷸뵺筍ъ쪎倻딅녂E 1110100110110110101011111110001110000101101100111110101010111010100000001110100010101010101111001110101110110111101110001110101110110101101110101110011110101101100011011101000110001010111011001010101010001110111001011000000010111011111010111001010010000101111010111000010010111101111010011011011010101111111000111000010110110011111010101011101010000000111010001010101010111100111010111011011110111000111010111011010110111010111001111010110110001101110100011000101011101100101010101000111011100101100000001011101111101011100101001000010111101011100001011000001001000101 e9b6afe385b3eaba80e8aabcebb7b8ebb5bae7ad8dd18aecaa8ee580bbeb9485eb84bde9b6afe385b3eaba80e8aabcebb7b8ebb5bae7ad8dd18aecaa8ee580bbeb9485eb858245
UHC 鶯ㅳ꺀誼뷸뵺筍ъ쪎倻딅넽鶯ㅳ꺀誼뷸뵺筍ъ쪎倻딅녂E 11100101101000111010010011100011100000111010100111101011111111101011101011100110100101001011100011100010111011001010110011101100101001011000100011100101101001101000101011101011100001101011011111100101101000111010010011100011100000111010100111101011111111101011101011100110100101001011100011100010111011001010110011101100101001011000100011100101101001101000101011101011100001101011101001000101 e5a3a4e383a9ebfebae694b8e2ecaceca588e5a68aeb86b7e5a3a4e383a9ebfebae694b8e2ecaceca588e5a68aeb86ba45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)