To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 闇μ?鎰??兪??n}闇μ?鎰??兪??n{^ 10001000110001011000001111001010001111111110100001001100001111110011111110011001011000000011111100111111011011100111110110001000110001011000001111001010001111111110100001001100001111110011111110011001011000000011111100111111011011100111101101011110 88c583ca3fe84c3f3f99603f3f6e7d88c583ca3fe84c3f3f99603f3f6e7b5e
EUC-JP 闇μ?鎰??兪??n}闇μ?鎰??兪??n{^ 10110000110001111010011011001100001111111110111110101101001111110011111111010001110000010011111100111111011011100111110110110000110001111010011011001100001111111110111110101101001111110011111111010001110000010011111100111111011011100111101101011110 b0c7a6cc3fefad3f3fd1c13f3f6e7db0c7a6cc3fefad3f3fd1c13f3f6e7b5e
UTF-8 闇μ쥜鎰먨쫿兪낆뒪n}闇μ쥜鎰먨쫿兪낆뒪n{^ 111010011001011110000111110011101011110011101100101001011001110011101001100011101011000011101011101010001010100011101100101010111011111111100101100001011010101011101011100000101000011011101011100100101010101001101110011111011110100110010111100001111100111010111100111011001010010110011100111010011000111010110000111010111010100010101000111011001010101110111111111001011000010110101010111010111000001010000110111010111001001010101010011011100111101101011110 e99787cebceca59ce98eb0eba8a8ecabbfe585aaeb8286eb92aa6e7de99787cebceca59ce98eb0eba8a8ecabbfe585aaeb8286eb92aa6e7b5e
UHC 闇μ쥜鎰먨쫿兪낆뒪n}闇μ쥜鎰먨쫿兪낆뒪n{^ 1110010011100001101001011110110010100010100100011110110011110000100100001110010110100110100101101110101011100100100001011110110010001010101001000110111001111101111001001110000110100101111011001010001010010001111011001111000010010000111001011010011010010110111010101110010010000101111011001000101010100100011011100111101101011110 e4e1a5eca291ecf090e5a696eae485ec8aa46e7de4e1a5eca291ecf090e5a696eae485ec8aa46e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)