To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN ?ろ?????ろ????h?ろ?????ろ? 0011111110000010111010110011111100111111001111110011111100111111100000101110101100111111001111110011111100111111011010000011111110000010111010110011111100111111001111110011111100111111100000101110101100111111 3f82eb3f3f3f3f3f82eb3f3f3f3f683f82eb3f3f3f3f3f82eb3f
EUC-JP ?ろ?????ろ????h?ろ?????ろ? 0011111110100100111011010011111100111111001111110011111100111111101001001110110100111111001111110011111100111111011010000011111110100100111011010011111100111111001111110011111100111111101001001110110100111111 3fa4ed3f3f3f3f3fa4ed3f3f3f3f683fa4ed3f3f3f3f3fa4ed3f
UTF-8 淋ろ쉪梨뺤광淋ろ쉪梨덉쮼h淋ろ쉪梨뺤광淋ろ쉪 11101111101001111011010111100011100000101000110111101100100010011010101011101111101001111010001011101011101110101010010011101010101101001001000111101111101001111011010111100011100000101000110111101100100010011010101011101111101001111010001011101011100011011000100111101100101011101011110001101000111011111010011110110101111000111000001010001101111011001000100110101010111011111010011110100010111010111011101010100100111010101011010010010001111011111010011110110101111000111000001010001101111011001000100110101010 efa7b5e3828dec89aaefa7a2ebbaa4eab491efa7b5e3828dec89aaefa7a2eb8d89ecaebc68efa7b5e3828dec89aaefa7a2ebbaa4eab491efa7b5e3828dec89aa
UHC 淋ろ쉪梨뺤광淋ろ쉪梨덉쮼h淋ろ쉪梨뺤광淋ろ쉪 11101100111110001010101011101101100110101000010011101100101100011001010111101100101100011010010011101100111110001010101011101101100110101000010011101100101100011000100011101100101010001001100001101000111011001111100010101010111011011001101010000100111011001011000110010101111011001011000110100100111011001111100010101010111011011001101010000100 ecf8aaed9a84ecb195ecb1a4ecf8aaed9a84ecb188eca89868ecf8aaed9a84ecb195ecb1a4ecf8aaed9a84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)