To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 蒻?????蹂??v蒻?????蹂??vB 11100100111010000011111100111111001111110011111100111111111001101111100000111111001111110111011011100100111010000011111100111111001111110011111100111111111001101111100000111111001111110111011001000010 e4e83f3f3f3f3fe6f83f3f76e4e83f3f3f3f3fe6f83f3f7642
EUC-JP 蒻?????蹂??v蒻?????蹂??vB 11101000111010100011111100111111001111110011111100111111111011001111101000111111001111110111011011101000111010100011111100111111001111110011111100111111111011001111101000111111001111110111011001000010 e8ea3f3f3f3f3fecfa3f3f76e8ea3f3f3f3f3fecfa3f3f7642
UTF-8 蒻멥굠璘뺡뒽蹂욏뜜v蒻멥굠璘뺡뒽蹂욏뜜vB 111010001001001010111011111010111010100110100101111010101011010110100000111011111010011110101111111010111011101010100001111010111001001010111101111010001011100110000010111011001001101010001111111010111001110010011100011101101110100010010010101110111110101110101001101001011110101010110101101000001110111110100111101011111110101110111010101000011110101110010010101111011110100010111001100000101110110010011010100011111110101110011100100111000111011001000010 e892bbeba9a5eab5a0efa7afebbaa1eb92bde8b982ec9a8feb9c9c76e892bbeba9a5eab5a0efa7afebbaa1eb92bde8b982ec9a8feb9c9c7642
UHC 蒻멥굠璘뺡뒽蹂욏뜜v蒻멥굠璘뺡뒽蹂욏뜜vB 111001011011011010111000111000111000001010001000111011001101111010010101111010011000101010110011111010111011001110011110111011011000110110011111011101101110010110110110101110001110001110000010100010001110110011011110100101011110100110001010101100111110101110110011100111101110110110001101100111110111011001000010 e5b6b8e38288ecde95e98ab3ebb39eed8d9f76e5b6b8e38288ecde95e98ab3ebb39eed8d9f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)