To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????U 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f55
SJIS-WIN 該??偕?該??偕?該??偕?該??偕?U 1000101001011001001111110011111110011000111100010011111110001010010110010011111100111111100110001111000100111111100010100101100100111111001111111001100011110001001111111000101001011001001111110011111110011000111100010011111101010101 8a593f3f98f13f8a593f3f98f13f8a593f3f98f13f8a593f3f98f13f55
EUC-JP 該??偕?該??偕?該??偕?該??偕?U 1011001110111010001111110011111111010000111100110011111110110011101110100011111100111111110100001111001100111111101100111011101000111111001111111101000011110011001111111011001110111010001111110011111111010000111100110011111101010101 b3ba3f3fd0f33fb3ba3f3fd0f33fb3ba3f3fd0f33fb3ba3f3fd0f33f55
UTF-8 該뚲궙偕쮅該뚲궙偕쮂該뚲궙偕쭿該뚲궙偕쭵U 11101000101010011011001011101011100110101011001011101010101101101001100111100101100000011001010111101100101011101000010111101000101010011011001011101011100110101011001011101010101101101001100111100101100000011001010111101100101011101000001011101000101010011011001011101011100110101011001011101010101101101001100111100101100000011001010111101100101011011011111111101000101010011011001011101011100110101011001011101010101101101001100111100101100000011001010111101100101011011011010101010101 e8a9b2eb9ab2eab699e58195ecae85e8a9b2eb9ab2eab699e58195ecae82e8a9b2eb9ab2eab699e58195ecadbfe8a9b2eb9ab2eab699e58195ecadb555
UHC 該뚲궙偕쮅該뚲궙偕쮂該뚲궙偕쭿該뚲궙偕쭵U 1111101010110001100011001110111010000010101011101111101010100101101010000101011111111010101100011000110011101110100000101010111011111010101001011010100001010100111110101011000110001100111011101000001010101110111110101010010110101000010100011111101010110001100011001110111010000010101011101111101010100101101010000100100101010101 fab18cee82aefaa5a857fab18cee82aefaa5a854fab18cee82aefaa5a851fab18cee82aefaa5a84955

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)