To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜???〓?瑤??唯??濡〓?吾 111000011001111100111111001111111000101101011000001111110011111100111111100000011010110000111111111010101010001000111111001111111001011101000010001111110011111110010100010001111000000110101100001111111000110011100001 e19f3f3f8b583f3f3f81ac3feaa23f3f97423f3f944781ac3f8ce1
EUC-JP 癲??宜???〓?瑤??唯??濡〓?吾 111000101010000100111111001111111011010110111001001111110011111100111111101000101010111000111111111101001010010000111111001111111100110110100011001111110011111111000111101010001010001010101110001111111011100011100011 e2a13f3fb5b93f3f3fa2ae3ff4a43f3fcda33f3fc7a8a2ae3fb8e3
UTF-8 癲덈챶宜밧래類〓젧瑤뗭슜唯㏝뼸濡〓쳜吾 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111011000010100111111010111001111010011000111011111010011110010000111000111000000010010011111011001010000010100111111001111001000110100100111010111001011110101101111011001000101010011100111001011001010010101111111000111000111110011101111010111011110010111000111001101011111110100001111000111000000010010011111011001011001110011100111001011001000010111110 e799b2eb8d88ecb1b6e5ae9cebb0a7eb9e98efa790e38093eca0a7e791a4eb97adec8a9ce594afe38f9debbcb8e6bfa1e38093ecb39ce590be
UHC 癲덈챶宜밧래類〓젧瑤뗭슜唯㏝뼸濡〓쳜吾 1110111110100110100010001110101110101010100000111110101111110001101110011110010110110111101000011110101110111010101000011110101110100000100111111110100011111101100010111110110010011010101010011110101011100110101001111110100110010110101110111110101110100001101000011110101110101011100000101110011111101110 efa688ebaa83ebf1b9e5b7a1ebbaa1eba09fe8fd8bec9aa9eae6a7e996bbeba1a1ebab82e7ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)