To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??瓣?日ヂ???牆ヂ???牆??∨〓 001111110011111111100001010000010011111110010011111110101000001101100001001111110011111100111111111000001010110110000011011000010011111100111111001111111110000010101101001111110011111110000001110010011000000110101100 3f3fe1413f93fa83613f3f3fe0ad83613f3f3fe0ad3f3f81c981ac
EUC-JP ??瓣?日ヂ???牆ヂ???牆??∨〓 001111110011111111100001101000100011111111000110111111001010010111000010001111110011111100111111111000001010111110100101110000100011111100111111001111111110000010101111001111110011111110100010110010111010001010101110 3f3fe1a23fc6fca5c23f3f3fe0afa5c23f3f3fe0af3f3fa2cba2ae
UTF-8 룶웬瓣룫日ヂ룫집룫牆ヂ룫집룫牆㈒룵∨〓 111010111010001110110110111011001001101110101100111001111001001110100011111010111010001110101011111001101001011110100101111000111000001110000010111010111010001110101011111011001010011110010001111010111010001110101011111001111000100110000110111000111000001110000010111010111010001110101011111011001010011110010001111010111010001110101011111001111000100110000110111000111000100010010010111010111010001110110101111000101000100010101000111000111000000010010011 eba3b6ec9bace793a3eba3abe697a5e38382eba3abeca791eba3abe78986e38382eba3abeca791eba3abe78986e38892eba3b5e288a8e38093
UHC 룶웬瓣룫日ヂ룫집룫牆ヂ룫집룫牆㈒룵∨〓 1000111110101011110000001010001011110111111110111000111110100010111011001110110110101011110000101000111110100010110000011111110110001111101000101110110111101101101010111100001010001111101000101100000111111101100011111010001011101101111011011010100111000011100011111010101010100001111111011010000111101011 8fabc0a2f7fb8fa2ecedabc28fa2c1fd8fa2ededabc28fa2c1fd8fa2ededa9c38faaa1fda1eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)