To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 閻??音??碎??n}閻??音??碎??n{^ 1110100010000101001111110011111110001001101110010011111100111111111000011110101000111111001111110110111001111101111010001000010100111111001111111000100110111001001111110011111111100001111010100011111100111111011011100111101101011110 e8853f3f89b93f3fe1ea3f3f6e7de8853f3f89b93f3fe1ea3f3f6e7b5e
EUC-JP 閻??音??碎??n}閻??音??碎??n{^ 1110111111100101001111110011111110110010101110110011111100111111111000101110110000111111001111110110111001111101111011111110010100111111001111111011001010111011001111110011111111100010111011000011111100111111011011100111101101011110 efe53f3fb2bb3f3fe2ec3f3f6e7defe53f3fb2bb3f3fe2ec3f3f6e7b5e
UTF-8 閻띿옋音뗧굨碎댁맄n}閻띿옋音뗧굨碎댁맄n{^ 1110100110010110101110111110101110011101101111111110110010011000100010111110100110011111101100111110101110010111101001111110101010110101101010001110011110100010100011101110101110001100100000011110101110100111100001000110111001111101111010011001011010111011111010111001110110111111111011001001100010001011111010011001111110110011111010111001011110100111111010101011010110101000111001111010001010001110111010111000110010000001111010111010011110000100011011100111101101011110 e996bbeb9dbfec988be99fb3eb97a7eab5a8e7a28eeb8c81eba7846e7de996bbeb9dbfec988be99fb3eb97a7eab5a8e7a28eeb8c81eba7846e7b5e
UHC 閻띿옋音뗧굨碎댁맄n}閻띿옋音뗧굨碎댁맄n{^ 1110011110100010100011011110110010011110100100111110101111100101100010111110011110000010100011101110000111101111101101001110110010010000100111100110111001111101111001111010001010001101111011001001111010010011111010111110010110001011111001111000001010001110111000011110111110110100111011001001000010011110011011100111101101011110 e7a28dec9e93ebe58be7828ee1efb4ec909e6e7de7a28dec9e93ebe58be7828ee1efb4ec909e6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)