To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鸚??揖??伎逸??音??鴦???B 111010100101111100111111001111111001011101001011001111110011111110001010111010101000100011101101001111110011111110001001101110010011111100111111111010011111000100111111001111110011111101000010 ea5f3f3f974b3f3f8aea88ed3f3f89b93f3fe9f13f3f3f42
EUC-JP 鸚??揖??伎逸??音??鴦???B 111100111100000000111111001111111100110110101100001111110011111110110100111011001011000011101111001111110011111110110010101110110011111100111111111100101111001100111111001111110011111101000010 f3c03f3fcdac3f3fb4ecb0ef3f3fb2bb3f3ff2f33f3f3f42
UTF-8 鸚쒖눦揖쇔럳伎逸닷쩂音쎌돹鴦볃랁뫒B 11101001101110001001101011101100100100101001011011101011100010001010011011100110100011111001011011101100100001111001010011101011100111111011001111100100101111001000111011101001100000001011100011101011100010111011011111101100101010011000001011101001100111111011001111101100100011101000110011101011100011111011100111101001101101001010011011101011101100111000001111101011100111101000000111101011101010111001001001000010 e9b89aec9296eb88a6e68f96ec8794eb9fb3e4bc8ee980b8eb8bb7eca982e99fb3ec8e8ceb8fb9e9b4a6ebb383eb9e81ebab9242
UHC 鸚쒖눦揖쇔럳伎逸닷쩂音쎌돹鴦볃랁뫒B 1110010110100100100111001110110010000111101111011110101111100111101111001110010110001110100100111101000011101011111011001110111110110100111001011010010010011100111010111110010110111101111011001000100110111100111001001110110010010011110100011000110111101101100100011011010001000010 e5a49cec87bdebe7bce58e93d0ebecefb4e5a49cebe5bdec89bce4ec93d18ded91b442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)