To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????AX?????????B\ 00111111001111110011111100111111001111110011111100111111001111110011111101000001010110000011111100111111001111110011111100111111001111110011111100111111001111110100001001011100 3f3f3f3f3f3f3f3f3f41583f3f3f3f3f3f3f3f3f425c
SJIS-WIN 陋帷鮒闢夂弗迴手弊AX陋帷鮒闢夂弗迴手弊B\ 11101000100110111001101111100111100101011010100111101000100100111001101011100111100101011010010011100111100011111000111011101000100101011011111001000001010110001110100010011011100110111110011110010101101010011110100010010011100110101110011110010101101001001110011110001111100011101110100010010101101111100100001001011100 e89b9be795a9e8939ae795a4e78f8ee895be4158e89b9be795a9e8939ae795a4e78f8ee895be425c
EUC-JP 陋帷鮒闢夂弗迴手弊AX陋帷鮒闢夂弗迴手弊B\ 11101111111110111101011011101001110010101010101111101111111100111101010011101001110010101010011011101101111011111011110011101010110010101100000001000001010110001110111111111011110101101110100111001010101010111110111111110011110101001110100111001010101001101110110111101111101111001110101011001010110000000100001001011100 effbd6e9caabeff3d4e9caa6edefbceacac04158effbd6e9caabeff3d4e9caa6edefbceacac0425c
UTF-8 陋帷鮒闢夂弗迴手弊AX陋帷鮒闢夂弗迴手弊B\ 11101001100110011000101111100101101110001011011111101001101011101001001011101001100101111010001011100101101001001000001011100101101111001001011111101000101111111011010011100110100010011000101111100101101111001000101001000001010110001110100110011001100010111110010110111000101101111110100110101110100100101110100110010111101000101110010110100100100000101110010110111100100101111110100010111111101101001110011010001001100010111110010110111100100010100100001001011100 e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e6898be5bc8a4158e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e6898be5bc8a425c
UHC 陋??闢?弗?手弊AX陋??闢?弗?手弊B\ 1101011110110000001111110011111111011100101000110011111111011101110101110011111111100010101000101111100011001001010000010101100011010111101100000011111100111111110111001010001100111111110111011101011100111111111000101010001011111000110010010100001001011100 d7b03f3fdca33fddd73fe2a2f8c94158d7b03f3fdca33fddd73fe2a2f8c9425c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)