To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 谷短遜狸則村谷短遜狸則村B 10010010010010101001001001011010100100011011101110010010010010111001000110100101100100011011101010010010010010101001001001011010100100011011101110010010010010111001000110100101100100011011101001000010 924a925a91bb924b91a591ba924a925a91bb924b91a591ba42
EUC-JP 谷短遜狸則村谷短遜狸則村B 11000011101010111100001110111011110000101011110111000011101011001100001010100111110000101011110011000011101010111100001110111011110000101011110111000011101011001100001010100111110000101011110001000010 c3abc3bbc2bdc3acc2a7c2bcc3abc3bbc2bdc3acc2a7c2bc42
UTF-8 谷短遜狸則村谷短遜狸則村B 11101000101100001011011111100111100111111010110111101001100000011001110011100111100010111011100011100101100010011000011111100110100111011001000111101000101100001011011111100111100111111010110111101001100000011001110011100111100010111011100011100101100010011000011111100110100111011001000101000010 e8b0b7e79fade9819ce78bb8e58987e69d91e8b0b7e79fade9819ce78bb8e58987e69d9142
UHC 谷短遜狸則村谷短遜狸則村B 11001101110110111101001110101101111000011110000111010111111000011111011011001110111101011011110111001101110110111101001110101101111000011110000111010111111000011111011011001110111101011011110101000010 cddbd3ade1e1d7e1f6cef5bdcddbd3ade1e1d7e1f6cef5bd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)