To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?臣ゅ?臣た??ぉ?臣ゅ?臣た??ぉB 0011111110010000011000101000001011100011001111111001000001100010100000101011110100111111001111111000001010100111001111111001000001100010100000101110001100111111100100000110001010000010101111010011111100111111100000101010011101000010 3f906282e33f906282bd3f3f82a73f906282e33f906282bd3f3f82a742
EUC-JP ?臣ゅ?臣た??ぉ?臣ゅ?臣た??ぉB 0011111110111111110000111010010011100101001111111011111111000011101001001011111100111111001111111010010010101001001111111011111111000011101001001110010100111111101111111100001110100100101111110011111100111111101001001010100101000010 3fbfc3a4e53fbfc3a4bf3f3fa4a93fbfc3a4e53fbfc3a4bf3f3fa4a942
UTF-8 룶臣ゅ룶臣た룵殺ぉ룶臣ゅ룶臣た룵殺ぉB 11101011101000111011011011101000100001111010001111100011100000101000010111101011101000111011011011101000100001111010001111100011100000011001111111101011101000111011010111101111101001011011000011100011100000011000100111101011101000111011011011101000100001111010001111100011100000101000010111101011101000111011011011101000100001111010001111100011100000011001111111101011101000111011010111101111101001011011000011100011100000011000100101000010 eba3b6e887a3e38285eba3b6e887a3e3819feba3b5efa5b0e38189eba3b6e887a3e38285eba3b6e887a3e3819feba3b5efa5b0e3818942
UHC 룶臣ゅ룶臣た룵殺ぉ룶臣ゅ룶臣た룵殺ぉB 10001111101010111110001111101101101010101110010110001111101010111110001111101101101010101011111110001111101010101110000111101101101010101010100110001111101010111110001111101101101010101110010110001111101010111110001111101101101010101011111110001111101010101110000111101101101010101010100101000010 8fabe3edaae58fabe3edaabf8faae1edaaa98fabe3edaae58fabe3edaabf8faae1edaaa942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)