To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ??????懿??v??????懿??vB 0011111100111111001111110011111100111111001111111001110011110010001111110011111101110110001111110011111100111111001111110011111100111111100111001111001000111111001111110111011001000010 3f3f3f3f3f3f9cf23f3f763f3f3f3f3f3f9cf23f3f7642
EUC-JP ???孼??懿??v???孼??懿??vB 001111110011111100111111100011111011101011000011001111110011111111011000111101000011111100111111011101100011111100111111001111111000111110111010110000110011111100111111110110001111010000111111001111110111011001000010 3f3f3f8fbac33f3fd8f43f3f763f3f3f8fbac33f3fd8f43f3f7642
UTF-8 溜깅젡孼꾩꺎懿깅졑v溜깅젡孼꾩꺎懿깅졑vB 111011111010011110001011111010101011100110000101111011001010000010100001111001011010110110111100111010101011111010101001111010101011101010001110111001101000011110111111111010101011100110000101111011001010000110010001011101101110111110100111100010111110101010111001100001011110110010100000101000011110010110101101101111001110101010111110101010011110101010111010100011101110011010000111101111111110101010111001100001011110110010100001100100010111011001000010 efa78beab985eca0a1e5adbceabea9eaba8ee687bfeab985eca19176efa78beab985eca0a1e5adbceabea9eaba8ee687bfeab985eca1917642
UHC 溜깅젡孼꾩꺎懿깅졑v溜깅젡孼꾩꺎懿깅졑vB 111010101111111010110001111010111010000010011010111001011110110110000100111011001000001110110100111010111111001110110001111010111010000010111110011101101110101011111110101100011110101110100000100110101110010111101101100001001110110010000011101101001110101111110011101100011110101110100000101111100111011001000010 eafeb1eba09ae5ed84ec83b4ebf3b1eba0be76eafeb1eba09ae5ed84ec83b4ebf3b1eba0be7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)