To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??幼??汲???幽??憶??飮??汲? 10001001010001110011111100111111100101110110001100111111001111111000101110000010001111110011111100111111100101110100100000111111001111111000100110101111001111110011111110011111010110100011111100111111100010111000001000111111 89473f3f97633f3f8b823f3f3f97483f3f89af3f3f9f5a3f3f8b823f
EUC-JP 烏??幼??汲堉??幽??憶??飮??汲堉 1011000110101000001111110011111111001101110001000011111100111111101101011110001010001111101101111111110100111111001111111100110110101001001111110011111110110010101100010011111100111111110111011011101100111111001111111011010111100010100011111011011111111101 b1a83f3fcdc43f3fb5e28fb7fd3f3fcda93f3fb2b13f3fddbb3f3fb5e28fb7fd
UTF-8 烏띻쑈幼싧♤汲堉삯찛幽뚰뮎憶얄댙飮닷♤汲堉 111001111000001110001111111010111001110110111011111011001001000110001000111001011011100110111100111011001000101110100111111000101001100110100100111001101011000110110010111001011010000010001001111011001000001010101111111011001011000010011011111001011011100110111101111010111001101010110000111010111010111010001110111001101000011010110110111011001001011010000100111010111000110010011001111010011010001110101110111010111000101110110111111000101001100110100100111001101011000110110010111001011010000010001001 e7838feb9dbbec9188e5b9bcec8ba7e299a4e6b1b2e5a089ec82afecb09be5b9bdeb9ab0ebae8ee686b6ec9684eb8c99e9a3aeeb8bb7e299a4e6b1b2e5a089
UHC 烏띻쑈幼싧♤汲堉삯찛幽뚰뮎憶얄댙飮닷♤汲堉 111010001010000110001101111010101011111010100100111010101110101010011010111001011010001010111011110100001110001111101011101111001011101111101001101010011001101111101010111010111000110011101101100100101001101111100101111000111011111011100010100010001011110111101011111001101011010011100101101000101011101111010000111000111110101110111100 e8a18deabea4eaea9ae5a2bbd0e3ebbcbbe9a99beaeb8ced929be5e3bee288bdebe6b4e5a2bbd0e3ebbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)