To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??媛??醫??鸚??爰??碎κ?筌??肄 11100010101000110011111100111111100101010101000100111111001111111110011111001110001111110011111111101010010111110011111100111111111000001010011100111111001111111110000111101010100000111100100000111111111000101010001100111111001111111110001111100101 e2a33f3f95513f3fe7ce3f3fea5f3f3fe0a73f3fe1ea83c83fe2a33f3fe3e5
EUC-JP 筌™?媛??醫??鸚??爰??碎κ?筌??肄 111001001010010110001111101000101110111100111111110010011011001000111111001111111110111011010000001111110011111111110011110000000011111100111111111000001010100100111111001111111110001011101100101001101100101000111111111001001010010100111111001111111110011011100111 e4a58fa2ef3fc9b23f3feed03f3ff3c03f3fe0a93f3fe2eca6ca3fe4a53f3fe6e7
UTF-8 筌™뫁媛몌㏊醫롮퐧鸚룸뗄爰쏙㎖碎κ갭筌듭쥙肄 1110011110101101100011001110001010000100101000101110101110101011100000011110010110101010100110111110101110101010100011001110001110001111100010101110100110000110101010111110101110100001101011101110110110010000101001111110100110111000100110101110101110100011101110001110101110010111100001001110011110001000101100001110110010001111100110011110001110001110100101101110011110100010100011101100111010111010111010101011000010101101111001111010110110001100111010111001001110101101111011001010010110011001111010001000001010000100 e7ad8ce284a2ebab81e5aa9bebaa8ce38f8ae986abeba1aeed90a7e9b89aeba3b8eb9784e788b0ec8f99e38e96e7a28ecebaeab0ade7ad8ceb93adeca599e88284
UHC 筌™뫁媛몌㏊醫롮퐧鸚룸뗄爰쏙㎖碎κ갭筌듭쥙肄 1110111110100111101000101110001010010001101001011110101010110000101110001110111110100111101101011110110010100010100011101110110010111101100100001110010110100100101101111110101110110110101111111110101010111010101111011110111110100111101000101110000111101111101001011110101010110000101110001110111110100111101101011110110010100010100011101110110010111101 efa7a2e291a5eab0b8efa7b5eca28eecbd90e5a4b7ebb6bfeababdefa7a2e1efa5eab0b8efa7b5eca28eecbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)