To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鏑?郵?臍????遠?鏑?郵?臍????遠 1001001101001100001111111001011101011000001111111110010001100000001111110011111100111111001111111000100110010011001111111001001101001100001111111001011101011000001111111110010001100000001111110011111100111111001111111000100110010011 934c3f97583fe4603f3f3f3f89933f934c3f97583fe4603f3f3f3f8993
EUC-JP 鏑?郵?臍??饔?遠?鏑?郵?臍??饔?遠 110001011010110100111111110011011011100100111111111001111100000100111111001111111000111111101000111011110011111110110001111100110011111111000101101011010011111111001101101110010011111111100111110000010011111100111111100011111110100011101111001111111011000111110011 c5ad3fcdb93fe7c13f3f8fe8ef3fb1f33fc5ad3fcdb93fe7c13f3f8fe8ef3fb1f3
UTF-8 鏑렊郵렮臍잴섈饔렧遠타鏑렊郵렮臍잴섈饔렧遠 111010011000111110010001111010111010000010001010111010011000001110110101111010111010000010101110111010001000011110001101111011001001111010110100111011001000010010001000111010011010010110010100111010111010000010100111111010011000000110100000111011011000001110000000111010011000111110010001111010111010000010001010111010011000001110110101111010111010000010101110111010001000011110001101111011001001111010110100111011001000010010001000111010011010010110010100111010111010000010100111111010011000000110100000 e98f91eba08ae983b5eba0aee8878dec9eb4ec8488e9a594eba0a7e981a0ed8380e98f91eba08ae983b5eba0aee8878dec9eb4ec8488e9a594eba0a7e981a0
UHC 鏑렊郵렮臍잴섈饔렧遠타鏑렊郵렮臍잴섈饔렧遠 111011101110101110001110101000011110100111101000100011101011101111110000101100001100000011101010101111001010101011101000101111011000111010110110111010101100000011000101101110001110111011101011100011101010000111101001111010001000111010111011111100001011000011000000111010101011110010101010111010001011110110001110101101101110101011000000 eeeb8ea1e9e88ebbf0b0c0eabcaae8bd8eb6eac0c5b8eeeb8ea1e9e88ebbf0b0c0eabcaae8bd8eb6eac0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)