To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 絶??菴??爺?????熱??B 1001000011100010001111110011111111100100101111010011111100111111100101101110101000111111001111110011111100111111001111111001010001001101001111110011111101000010 90e23f3fe4bd3f3f96ea3f3f3f3f3f944d3f3f42
EUC-JP 絶??菴??爺?????熱??B 1100000011100100001111110011111111101000101111110011111100111111110011001110110000111111001111110011111100111111001111111100011110101110001111110011111101000010 c0e43f3fe8bf3f3fccec3f3f3f3f3fc7ae3f3f42
UTF-8 絶귡쓾菴든뮥爺뽳푴燎쇿궢熱겼쵔B 11100111101101011011011011101010101101111010000111101100100100111011111011101000100011111011010011101011100100111010000011101011101011101010010111100111100010001011101011101011101111011011001111101101100100011011010011101111101001111000000011101100100001111011111111101010101101101010001011100111100001101011000111101010101100101011110011101100101101011001010001000010 e7b5b6eab7a1ec93bee88fb4eb93a0ebaea5e788baebbdb3ed91b4efa780ec87bfeab6a2e786b1eab2bcecb59442
UHC 絶귡쓾菴든뮥爺뽳푴燎쇿궢熱겼쵔B 11101111101111101000001011101001100111011001100111100100111000001011010111100111100100101011000011100101101011001001011011101111101111101000001011101000111110111001100111100101100000101011010111100110111100001011000011100101101011001001011001000010 efbe82e99d99e4e0b5e792b0e5ac96efbe82e8fb99e582b5e6f0b0e5ac9642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)